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(57) Abstract: Dominant negative alleles of human mismatch repair genes can be used to generate hypermutable cells and organisms. 
Q By introducing these genes into cells and transgenic animals, new cell lines and animal varieties with novel and useful properties can 
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^ and of making various cells mismatch repari defective through the use of chemicals to block mismatch repari in in vivo are disclosed. 
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CHEMICAL INHIBITORS OF MISMATCH REPAIR 

TECHNICAL FIELD OF THE INVENTION 

The invention is related to the area of mutagenesis. In particular it is related to the 
5 field of blocking specific DNA repair processes. 

BACKGROUND OF THE INVENTION 

Mismatch repair (MMR) is a conserved DNA repair process that is involved in 
post-replicative repair of mutated DNA sequences that occurs after genome replication. 

10 The process involves a group of gene products, including the mutS homologs GTBP, 

hMSH2, and hMSH3 and the mutL homologs hMLHl, hPMSl, and hPMS2 (Bronner, C.E. 
et al. (1994) Nature 368:258-261; Papadopoulos, N. et al. (1994) Science 263:1625-1629; 
Leach, F.S. et al. (1993) I Cell 75:1215-1225; Nicolaides, N.C. et al. (1994) Nature 
371 :75-80) that work in concert to correct mispaired mono-, di-, and tri-nucleotides, point 

15 mutations, and to monitor for correct homologous recombination. Germline mutations in 
any of the genes involved in this process results in global point mutations, and instability 
of mono, di and tri-nucleotide repeats (a feature referred to as microsatellite instability 
(MI)), throughout the genome of the host cell. In man, genetic defects in MMR results in 
the predisposition to hereditary nonpolyposis colon cancer, a disease in which tumors 

20 retain a diploid genome but have widespread MI (Bronner, C.E. et al. (1994) Nature 
368:258-261; Papadopoulos, N. et al. (1994) Science 263:1625-1629; Leach, F.S. et al. 
(1993) Cell 75:1215-1225; Nicolaides, N.C. et al. (1994) Nature 371 :75-80; Harfe B.D., 
and S. Jinks-Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 
266:1959-1960). Though the mutator defect that arises from MMR deficiency can affect 

25 any DNA sequence, microsateUite sequences are particularly sensitive to MMR 

abnormalities (Peinado, M.A. et a/.(1992) Proc. Natl. Acad. Sci. USA 89:10065-10069). 
Microsatellite testability is therefore a useful indicator of defective MMR. In addition to 
its occurrence in virtually all tumors arising in HNPCC patients, MI is found in a small 
fraction of sporadic tumors with distinctive molecular and phenotypic properties that is 

30 due to defective MMR (Perucho, M. (1996) Biol. Chetn. 377:675-684). 

MMR deficiency leads to a wide spectrum of mutations (point mutations, 
insertions, deletions, recombination, etc.) that can occur throughout the genome of a host 
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cell. This effect has been found to occur across a diverse array of organisms ranging from 
but not limited to unicellular microbes, such as bacteria and yeast, to more complex 
organisms such as Drosophila and mammals, including mice and humans (Harfe B.D., and 
S. Jinks-Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 
5 266: 1959-1960). The ability to block MMR in a normal host cell or organism can result in 
the generation of genetically altered offspring or sibling cells that have desirable output 
traits for applications such as but not limited to agriculture, pharmaceutical, chemical 
manufacturing and specialty goods. A chemical method that can block the MMR process 
is beneficial for generating genetically altered hosts with commercially valuable output 

10 traits. A chemical strategy for blocking MMR in vivo offers a great advantage over a 

recombinant approach for producing genetically altered host organisms. One advantage is 
that a chemical approach bypasses the need for introducing foreign DNA into a host, 
resulting in a rapid approach for inactivating MMR and generating genetically diverse 
offspring or sib cells. Moreover, a chemical process is highly regulated in that once a host 

15 organism with a desired output trait is generated, the chemical is removed from the host 
and its MMR process would be restored, thus fixing the genetic alteration in subsequent 
generations. The invention described herein is directed to the discovery of small 
molecules that are capable of blocking MMR, thus resulting in host organisms with MI, a 
hallmark of MMR deficiency (Peinado, M.A et al. (1992) Proa Natl. Acad. Sci. USA 

20 89:10065-10069; Perucho, M. (1996) Biol. Chem. 377:675-684; Wheeler, J.M. et al. 
(2000) J. Med. Genet. 37:588-592; Hoang, J.M. etal. (1997) Cancer Res. 57:300-303). 
Moreover, host organisms exhibiting MI are then selected for to identify subtypes with 
new output traits, such as but not limited to mutant nucleic acid molecules, polypeptides, 
biochemicals, physical appearance at the microscopic and/or macroscopic level, or 

25 phenotypic alterations in a whole organism. In addition, the ability to develop MMR 
defective host cells by a chemical agent provides a valuable method for creating 
genetically altered cell hosts for product development. The invention described herein is 
directed to the creation of genetically altered cell hosts via the blockade of MMR using 
chemical agents in vivo. 

30 The advantages of the present invention are further described in the examples and 

figures described within this document. 
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SUMMARY OF THE INVENTION 

The invention provides methods for rendering cells hypermutable by blocking 
MMR activity with chemical agents. 
5 The invention also provides genetically altered cell lines which have mutations 

introduced through interruption of mismatch repair. 

The invention further provides methods to produce an enhanced rate of genetic 
hypermutation in a cell. 

The invention encompasses methods of mutating a gene of interest in a cell, 
10 methods of creating cells with new phenotypes, and methods of creating cells with new 
phenotypes and a stable genome. 

The invention also provides methods of creating genetically altered whole 
organisms and methods of creating whole organisms with new phenotypes. 

These and other objects of the invention are provided by one or more of the 
15 embodiments described below. 

In one embodiment of the invention, a method for screening chemical compounds 
that block mismatch repair (MMR) is provided. An MMR-sensitive reporter gene 
containing an out-of-frame polynucleotide repeat in its coding region is introduced into an 
MMR proficient cell. The cell is grown in the presence of chemicals. Chemicals that alter 
20 the genetic structure of the polynucleotide repeat yield a biologically active reporter gene 
product. Chemicals that disrupt the polynucleotide repeat are identified as MMR blocking 
agents. 

In another embodiment of the invention, an isolated MMR blocking chemical is 
provided. The chemical can block MMR of a host cell, yielding a cell that exhibits an 
25 enhanced rate of hypermutation. 

In another embodiment of the invention, a method is provided for introducing a 
mutation into a gene of interest. A chemical that blocks mismatch repair is added to the 
culture of a cell line. The cells become hypermutable as a result of the introduction of the 
chemical. The cell further comprises a gene of interest. The cell is cultured and tested to 
30 determine whether the gene of interest harbors a mutation. 

In another embodiment of the invention, a method is provided for producing new 
phenotypes of a cell. A chemical that blocks mismatch repair is added to a cell culture. 


WO 02/054856 


4 


PCT/US01/00934 


The cell becomes hypermutable as a result of the introduction of the chemical. The cell is 
cultured and tested for the expression of new phenotypes. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a cell in which mismatch repair is blocked via a chemical agent. The chemical 
5 is removed from the cell culture and the cell restores its genetic stability. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a cell with blocked mismatch repair and a newly selected phenotype. The 
chemical agent is removed from the cell culture and the cell restores its genetic stability 
and the new phenotype is stable. 
10 In another embodiment of the invention, a chemical method for blocking MMR in 

plants is provided. The plant is grown in the presence of a chemical agent. The plant is 
grown and exhibits an enhanced rate of hypermutation. 

In another embodiment of the invention, a method for screening chemical 
inhibitors of MMR in plants in vivo is provided. MMR-sensitive plant expression vectors 
15 are engineered. The reporter vectors are introduced into plant hosts. The plant is grown in 
the presence of a chemical agent. The plant is monitored for altered reporter gene 
function. 

In another embodiment of the invention, a method is provided for introducing a 
mutation into a gene of interest in a plant. A chemical that blocks mismatch repair is 
20 added to a plant. The plant becomes hypermutable as a result of the introduction of the 
chemical. The plant further comprises a gene of interest. The plant is grown. The plant is 
tested to determine whether the gene of interest harbors a mutation. 

In another embodiment of the invention, a method is provided for producing new 
phenotypes of a plant. A chemical that blocks mismatch repair is added to a plant. The 
25 plant becomes hypermutable as a result of the introduction of the chemical. The plant is 
grown and tested for the expression of new phenotypes. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a plant in which mismatch repair is blocked via a chemical agent. The 
chemical is removed from the plant culture and the plant restores its genetic stability. 
30 In another embodiment of the invention, a method is provided for restoring genetic 

stability in a plant with blocked mismatch repair and a newly selected phenotype. The 
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chemical agent is removed from the plant culture and the plant restores its genetic stability 
and the new phenotype is stable. 

These and other embodiments of the invention provide the art with methods that 
can generate enhanced mutability in microbes, organisms of the protista class, insect cells, 
5 mammalian cells, plants, and animals as well as providing cells, plants and animals 
harboring potentially useful mutations. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows diagrams of mismatch repair (MMR) sensitive reporter genes. 
10 Figure 2 shows a screening method for identifying MMR blocking chemicals. 

Figure 3 shows identification of a small chemical that blocks MMR and genetically alters 
the pCAR-OF vector in vivo. 

Figure 4 shows shifting of endogenous microsatellites in human cells induced by a 
chemical inhibitor of MMR. 
15 Figure 5 shows sequence analysis of microsatellites from cells treated with chemical 
inhibitors of MMR with altered repeats. 

Figure 6 shows generation of host organisms with new phenotypes using a chemical 
blocker of MMR. 

Figure 7 shows a schematic diagram of MMR-sensitive reporter gene for plants. 
20 Figure 8 shows derivatives of lead compounds and thereof that are inhibitors of MMR in 
vivo. 

DETAILED DESCRIPTION OF THE INVENTION 

Various definitions are provided herein. Most words and terms have the meaning 
25 that would be attributed to those words by one skilled in the art. Words or terms 
specifically defined herein have the meaning provided in the context of the present 
invention as a whole and as are typically understood by those skilled in the art. Any 
conflict between an art-understood definition of a word or term and a definition of the 
word or term as specifically taught herein shall be resolved in favor of the latter. Headings 
30 used herein are for convenience and are not to be construed as limiting. 

As used herein the term "anthracene" refers to the compound anthracene. However, 
when referred to in the general sense, such as "anthracenes," "an anthracene" or "the 
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• anthracene," such terms denote any compound that contains the fused triphenyl core structure 
of anthracene, i.e., 


wherein R r R 10 are independently hydrogen, hydroxyl, arnino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyLN-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
10 arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an arnino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

15 wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 

heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
20 aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups; 

or wherein any two of R r R 10 can together form a polyether; 

or wherein any two of R t -R 10 can, together with the intervening carbon atoms of the 
anthracene core, form a crown ether. 



regardless of extent of substitution. 


R s R io R 4 

In certain preferred embodiments of the invention, the anthracene has the formula: 



5 
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As used herein, "alky!" refers to a hydrocarbon containing from 1 to about 20 carbon 
atoms. Alkyl groups may straight, branched, cyclic, or combinations thereof. Alkyl groups 
thus include, by way of illustration only, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, 
cyclopentyl, cyclopentylmethyl, cyclohexyl, cyclohexylmethyl, and the like. Also included 
5 within the definition of "alkyl" are fused and/or polycyclic aliphatic cyclic ring systems such 
as, for example, adamantane. As used herein the term "alkenyl" denotes an alkyl group 
having at least one carhon-carbon double bond. As used herein the term "alkynyl" denotes 
an alkyl group having at least one carbon-carbon triple bond. 

In some preferred embodiments, the alkyl, alkenyl, alkynyl, aryl, aryloxy, and 

10 heteroaryl substituent groups described above may bear one or more further substituent 
groups; that is, they may be "substituted". In some preferred embodiments these substituent 
groups can include halogens (for example fluorine, chlorine, bromine and iodine), CN, N0 2 , 
lower alkyl groups, aryl groups, heteroaryl groups, aralkyl groups, aralkyloxy groups, 
guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino groups. In addition, the 

15 alkyl and aryl portions of aralkyloxy, arylalkyl, arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, 
and aryloxycarbonyl groups also can bear such substituent groups. Thus, by way of example 
only, substituted alkyl groups include, for example, alkyl groups fluoro-, chloro-, bromo- and 
iodoalkyl groups, aminoalkyl groups, and hydroxyalkyl groups, such as hydroxymethyl, 
hydroxyethyl, hydroxypropyl, hydroxybutyl, and the like. In some preferred embodiments 

20 such hydroxyalkyl groups contain from 1 to about 20 carbons. 

As used herein the term "aryl" means a group having 5 to about 20 carbon atoms and 
which contains at least one aromatic ring, such as phenyl, biphenyl and naphthyl. Preferred 
aryl groups include unsubstituted or substituted phenyl and naphthyl groups. The term 
"aryloxy" denotes an aryl group that is bound through an oxygen atom, for example a phenoxy 

25 group. 

In general, the prefix "hetero" denotes the presence of at least one hetero (i.e., non- 
carbon) atom, which is in some preferred embodiments independently one to three O, N, S, 
P, Si or metal atoms. Thus, the term "heteroaryl" denotes an aryl group in which one or more 
ring carbon atom is replaced by such a heteroatom. Preferred heteroaryl groups include 
pyridyl, pyrimidyl, pyrrolyl, furyl, thienyl, and imidazolyl groups. 

The term "aralkyl" (or "arylalkyl") is intended to denote a group having from 6 to 15 
carbons, consisting of an alkyl group that bears an aryl group. Examples of aralkyl groups 
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In general, the prefix "hetero" denotes the presence of at least one hetero (i.e., non- 
carbon) atom, which is in some preferred embodiments independently one to three O, N, S, 
P, Si or metal atoms. Thus, the term "heteroaryl" denotes an aryl group in which one or more 
ring carbon atom is replaced by such a heteroatom. Preferred heteroaryl groups include 
5 pyridyl, pyrimidyl, pyrrolyl, furyl, thienyl, and imidazolyl groups. 

The term "aralkyl" (or "arylalkyl") is intended to denote a group having from 6 to 15 
carbons, consisting of an alkyl group that bears an aryl group. Examples of aralkyl groups 
include benzyl, phenethyl, benzhydryl and naphthylmethyl groups. 

The term "alkylaryl" (or "alkaryl") is intended to denote a group having from 6 to 15 
10 carbons, consisting of an aryl group that bears an alkyl group. Examples of aralkyl groups 
include methylphenyl, ethylphenyl and methylnaphthyl groups. 

The term "arylsulfonyl" denotes an aryl group attached through a sulfonyl group, for 
example phenylsulfonyl. The term "alkylsulfonyl" denotes an alkyl group attached through 
a sulfonyl group, for example methylsulfonyl. 
15 The term "alkoxycarbonyl" denotes a group of formula -C(=0)-0-R where R is alkyl 

alkenyl, or alkynyl, where the alkyl, alkenyl, or alkynyl portions thereof can be optionally 
substituted as described herein. 

The term "aryloxycarbonyl" denotes a group of formula -C(=0)-0-R where R is aryl, 
where the aryl portion thereof can be optionally substituted as described herein. 
20 The terms "arylalkyloxy" or "aralkyloxy" are equivalent, and denote a group of 

formula -0-R / -R // , where R' is R is alkyl, alkenyl, or alkynyl which can be optionally 
substituted as described herein, and wherein R" denotes a aryl or substituted aryl group. 

The terms "alkylaryloxy" or "alkaryloxy" are equivalent, and denote a group of 
formula -O-R'-R* where R 7 is an aryl or substituted aryl group, and R 7/ is alkyl, alkenyl, or 
25 alkynyl which can be optionally substituted as described herein. 

As used herein, the term "aldehyde group" denotes a group that hears a moiety of 
formula -C(=0)~H. The term "ketone" denotes a moiety containing a group of formula -R- 
C(=0)-R=, where R and R= are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 
or alkaryl, each of which may be substituted as described herein. 
30 As used herein, the term "ester" denotes a moiety having a group of formula -R- 

C(=0)-0-R= or -R-0-C(=0)-R= where R andR= are independently alkyl, alkenyl alkynyl, 
aryl, heteroaryl, aralkyl, or alkaryl, each of which may be substituted as described herein. 
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The term "ether" denotes a moiety having a group of formula -R-0-R= or where R and 
R= are independently alkyl, alkenyL alkynyl, aryl, heteroaryl, aralkyl, or alkaryl, each of 
which may be substituted as described herein. 

The term "crown ether" has its usual meaning of a cyclic ether containing several 
5 oxygen atoms. As used herein the term "organosulfur compound" denotes aliphatic or 
aromatic sulfur containing compounds, for example thiols and disulfides. The term 
"organometallic group" denotes an organic molecule containing at least one metal atom. 

The term "organosilicon compound" denotes aliphatic or aromatic silicon containing 
compounds, for example alkyl and aryl silanes. 
10 The term "carboxylic acid" denotes a moiety having a carboxyl group, other than an 

amino acid. 

As used herein, the term "amino acid" denotes a molecule containing both an amino 
group and a carboxyl group. In some preferred embodiments, the amino acids are a-, p-, y- 
or S-amino acids, including their stereoisomers and racemates. As used herein the term 'T- 
IS amino acid" denotes an a-amino acid having the L configuration around the a-carbon, that is, 
a carboxylic acid of general formula CH(COOH)(NH 2 )-(side chain), having the L- 
configuration. The term "D-amino acid" similarly denotes a carboxylic acid of general 
formula CH(COOH)(NH 2 )-(side chain), having the D-configuration around the a-carbon. Side 
chains of L-amino acids include naturally occurring and non-naturally occurring moieties. 
20 Non-naturally occurring (i.e., unnatural) amino acid side chains are moieties that are used in 
place of naturally occurring amino acid side chains in, for example, armno acid analogs. See, 
for example, Lehninger, Biochemistry, Second Edition, Worth Publishers, hie, 1975, pages 
72-77, incorporated herein by reference. Amino acid substituents may be attached through 
their carbonyl groups through the oxygen or carbonyl carbon thereof, or through their amino 
25 groups, or through functionalities residing on their side chain portions. 

As used herein "polynucleotide" refers to a nucleic acid molecule and includes 
genomic DNA cDNA, RNA, mRNA and the like. 

As used herein "antisense oligonucleotide" refers to a nucleic acid molecule that is 
complementary to at least aportion of a target nucleotide sequence of interest and specifically 
30 hybridizes to the target nucleotide sequence under physiological conditions. 

As used herein "inhibitor of mismatch repair" refers to an agent that interferes with at 
least one function of the mismatch repair system of a cell and thereby renders the cell more 
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susceptible to mutation. 

As used herein "hypermutable" refers to a state in which a cell in vitro or in vivo is 
made more susceptible to mutation through a loss or impairment of the mismatch repair 
system. 

5 As used herein "agents," "chemicals," and "inhibitors" when used in connection 

with inhibition of MMR refers to chemicals, oligonucleotides, analogs of natural 
substrates, and the like that interfere with normal function of MMR. 

Methods for developing hypermutable cells and whole organisms have been 
discovered by taking advantage of the conserved mismatch repair (MMR) process of a 

10 host. Dominant negative alleles of MMR genes, when introduced into cells or transgenic 
animals, increase the rate of spontaneous mutations by reducing the effectiveness of DNA 
repair and thereby render the cells or animals hypermutable. Hypermutable microbes, 
protozoans, insects, mammalian cells, plants or whole animals can then be utilized to 
develop new mutations in a gene of interest. It has been discovered that chemicals that 

15 block MMR, and thereby render cells hypermutable, is an efficient way to introduce 
mutations in cells and genes of interest. In addition to destabilizing the genome of cells 
exposed to chemicals that inhibit MMR activity may be done transiently, allowing cells to 
become hypermutable, and removing the chemical exposure after the desired effect (e.g., a 
mutation in a gene of interest) is achieved. The chemicals that inhibit MMR activity that 

20 are suitable for use in the invention include, but are not limited to, anthracene derivatives, 
nonhydrolyzable ATP analogs, ATPase inhibitors, antisense oligonucleotides that 
specifically anneal to polynucleotides encoding mismatch repair proteins, DNA 
polymerase inhibitors, and exonuclease inhibitors. These chemicals can enhance the rate 
of mutation due to inactivation of MMR yielding clones or subtypes with altered 

25 biochemical properties. Methods for identifying chemical compounds that inhibit MMR 
in vivo are also described herein. 

The process of MMR, also called mismatch proofreading, is carried out by a group 
of protein complexes in cells ranging from bacteria to man (Harfe B.D., and S. Jinks- 
Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 

30 266:1959-1960). An MMR gene is a gene that encodes for one of the proteins of such a 
mismatch repair complex. Although not wanting to be bound by any particular theory of 
mechanism of action, an MMR complex is believed to detect distortions of the DNA helix 
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resulting from non-complementary pairing of nucleotide bases. The non-complementary 
base on the newer DNA strand is excised, and the excised base is replaced with the 
appropriate base, which is complementary to the older DNA strand. lh this way, cells 
eliminate many mutations that occur as a result of mistakes in DNA replication. 
5 Dominant negative alleles cause an MMR defective phenotype even in the presence 

of a wild-type allele in the same cell. An example of a dominant negative allele of an 
MMR gene is the human gene hPMS2-134 (SEQ ID NO:25), which carries a truncating 
mutation at codon 134 (Nicolaides, N.C. etal. (1998) Mol. Cell. Biol. 18:1635-1641). The 
mutation causes the product of this gene to abnormally terminate at the position of the 

10 ' 134th amino acid, resulting in a shortened polypeptide containing the N-terminal 133 

amino acids (SEQ ED NO:24). Such a mutation causes an increase in the rate of mutations, 
which accumulate in cells after DNA replication. Expression of a dominant negative allele 
of a mismatch repair gene results in impairment of mismatch repair activity, even in the 
presence of the wild-type allele. 

15 The MMR process has been shown to be blocked by the use of nonhydrolyzable 

forms of ATP (Galio, L. et al. (1999) Nucl. Acids Res. 27:2325-2331; Allen, DJ. et al. 
(\991)EMBOJ. 16:4467-4476; Bjornson, K.P. a/. (2000) Biochem. 39:3176-3183). 
However, it has not been demonstrated that chemicals can block MMR activity in cells. 
Such chemicals can be identified by screening cells for defective MMR activity. Cells 

20 from bacteria, yeast, fungi, insects, plants, animals, and humans can be screened for 
defective mismatch repair. Genomic DNA, cDNA, or mRNA from any cell can be 
analyzed for variations from the wild type sequences in cells or organisms grown in the 
presence of MMR blocking compounds. Various techniques of screening can be used. 
The suitability of such screening assays, whether natural or artificial, for use in identifying 

25 hypermutable cells, insects, fungi, plants or animals can be evaluated by testing the 

mismatch repair activity caused by a compound or a mixture of compounds, to determine 
if it is an MMR inhibitor. 

A cell, a microbe, or a whole organism such as an insect, fungus, plant or animai in 
which a chemical inhibitor of mismatch repair has been treated will become hypermutable. 

30 This means that the spontaneous mutation rate of such cells or whole organism is elevated 
compared to cells or animals without such treatment. The degree of elevation of the 
spontaneous mutation rate can be at least 2-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100- 
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fold, 200-fold, 500-fold, or 1000-fold that of the normal cell or animal. The use of 
chemical mutagens such as, but limited to, N-methyl-N'-m^o-N-mtrosoguanidine 
(MNNG), methane sulfonate, dimethyl sulfonate, 06-methyl benzadine, ethyl 
methanesulfonate (EMS), methylnitrosourea (MNU), ethylnitrosourea (ENU), etc. can be 
5 used in MMR defective cells or whole organisms to increase the rates an additional 1 0 to 
100 fold that of the MMR deficiency itself. 

According to one aspect of the invention, a screening assay for identifying 
chemical inhibitors of MMR is developed and employed. A chemical compound can be in 
any form or class ranging from but not limited to amino acid, steroidal, aromatic, or lipid 
10 precursors. The chemical compound can be naturally occurring or made in the laboratory. 
The screening assay can be natural such as looking for altered endogenous repeats within 
an host organism's genome (as demonstrated in Figs. 4 and 5), or made in the laboratory 
using an MMR-sensitive reporter gene as demonstrated in Figs. 1-3). 

The chemical compound can be introduced into the cell by supplementing the 
15 growth medium, or by intracellular delivery such as but not limited to using microinjection 
or carrier compounds. 

According to another aspect of the invention, a chemical compound from the 
anthracene class can be exposed to MMR proficient cells or whole organism hosts, the host 
is grown and screened for subtypes containing genetically altered genes with new 
20 biochemical features. 

The anthracene compounds that are suitable for use in the invention include, but 
are not limited to anthracenes having the formula: 



wherein R r R 10 are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
25 substituted alkenyl, alkynyl, substituted alkyrryl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryL aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
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an amino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxy! 
groups; 

5 wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 

heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryL heteroaryl, 
10 aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and arnino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups; 

or wherein any two of Rj-Rio can together form a polyether; 
or wherein any two of R r R 10 can, together with the intervening carbon atoms of the 
15 anthracene core, form a crown ether. 

The method of the invention also encompasses inhibiting MMR with an anthracene 
of the above formula wherein Rj and Rg are hydrogen, and the remaining substituents are as 
described above. 

The some embodiments, in the anthracene compound R r R, 0 are independently 
20 hydrogen, hydroxyl, alkyl, aryL arylaklyl, or hydroxyalkyl. In other embodiments, R r R I0 are 

independently hydrogen, hydroxyl, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, 

tolyl, hydroxymethyl, hydroxypropyl, orhydroxybutyl. 

In specific embodiments of the invention the anthracenes include, but are not 

limited to 1,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 
25 9, 1 0-diphenylanthracene, 9, 1 0-dmydroxymethylanthracene, 9-hydroxymethyl- 1 0- 

methylanthracene, dimethylanthracene-l,2-diol, 9-hydroxymemyl-10-memylanthracene- 

1,2-diol, 9-hydVoxymethyl-10-methylanthracene-3,4-diol, 9, 10-di-m-tolyanthracene,.and 

the like. 

The chiral position of the side chains of the anthracenes is not particularly limited 
30 and may be any chiral position and any chiral analog. The anthracenes may also comprise 
a stereoisomeric forms of the anthracenes and includes any isomeric analog. 
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Examples of hosts are but not limited to cells or whole organisms from human, 
primate, mammal, rodent, plant, fish, reptiles, amphibians, insects, fungi, yeast or 
microbes of prokaryotic origin. 

Yet another aspect of the invention is the use of ATP analogs capable of blocking 
5 ATPase activity required for MMR. MMR reporter cells are screened with ATP 
compound libraries to identify those compounds capable of blocking MMR in vivo. 
Examples of ATP analogs that are useful in blocking MMR activity include, but are not 
limited to, nonhydrolyzable forms of ATP such as AMP-PNP and ATP[gamma]S block 
the MMR activity (Galio, L. et al. (1999) Nucl. Acids Res. 27:2325-2331; Allen, D.J. et al. 

10 (1997) EMBO J. 16:4467-4476; Bjornson K.P. et al. (2000) Biochem. 39:3176-3183). 

Yet another aspect of the invention is the use of nuclease inhibitors that are able to 
block the exonuclease activity of the MMR biochemical pathway. MMR reporter cells are 
screened with nuclease inhibitor compound libraries to identify compounds capable of 
blocking MMR in vivo. Examples of nuclease inhibitors that are useful in blocking MMR 

15 activity include, but are not limited to analogs of N-Ethyhnaleimide, an endonuclease 
inhibitor (Huang, Y.C., et.al. (1995) Arch. Biochem. Biophys. 316:485), heterodimeric 
adenme-cham-acridine compounds, exonulcease HI inhibitors (Belmont P, et.al., Bioorg 
Med Chem Lett (2000) 10:293-295), as well as antibiotic compounds such as 
HeHquinomycin, which have helicase inhibitory activity (Chino, M, etal. J. Antibiot. 

20 (Tokyo) (1998) 51:480-486). 

Another aspect of the invention is the use of DNA polymerase inhibitors that are 
able to block the polymerization required for mismatch-mediated repair. MMR reporter 
cells are screened with DNA polymerase inhibitor compound libraries to identify those 
compounds capable of blocking MMR in vivo. Examples of DNA polymerase inhibitors 

25 that are useful in blocking MMR activity include, but are not limited to, analogs of 
actinomycin D (Martin, S J., etal. (1990) J. Immunol. 145:1859), Aphidicolin 
(Kuwakado, K. etal. (1993) Biochem. Pharmacol. 46:1909) l-(2'-Deoxy-2*-fluoro-beta-L- 
arabmofuranosyl)-5-methyluracil (L-FMAU) (Kukhanova M, etal., Biochem Pharmacol 
(1998) 55:1181-1187), and 2',3'-dideoxyribonucleoside 5'-triphosphates (ddNTPs) (Ono, 

30 K., etal., Biomed Pharmacother (1984) 38:382-389). 

In yet another aspect of the invention, antisense oligonucleotides are administered 
to cells to disrupt at least one function of the mismatch repair process. The antisense 
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polynucleotides hybridize to MMR polynucleotides. Both full-length and antisense 
polynucleotide frgaments are suitable for use. "Antisense polynucleotide fragments" of 
the invention include, but are not limited to polynuclotides that specifically hybridize to an 
MMR encoding RNA (as determined by sequence comparison of nucleotides encoding the 
5 MMR to nucleotides encoding other known molecules). Identification of sequences that 
are substantially unique to MMR-encoding polynucleotides can be ascertained by analysis 
of any publicly available sequence database and/or with any commercially available 
1 sequence comparison programs. Antisense molecules may be generated by any means 
mcluding, but not limited to chemical synthesis, expression in an in vitro transcription 

10 reaction, through expression in a transformed cell comprising a vector that may be 

transcribed to produce antisense molecules, through restriction digestion and isolation, 
through the polymerase chain reaction, and the like. 

Antisense oligonucleotides, or fragments thereof may include the nucleotide 
sequences set forth in SEQ ID NOs:15, 17, 19, 21, 23, 25, 27, and 29 or sequences 

15 complementary or homologous thereto, for example. Those of skill in the art recognize 
that the invention may be predicted using any MMR gene. Specifically, antisense nucleic 
acid molecules comprise a sequence complementary to at least about 10, 15, 25, 50, 100, 
250 or 500 nucleotides or an entire MMR encoding sequence. Preferably, the antisense 
oligonucleotides comprise a sequence complementary to about 15 consecutive nucleotides 

20 of the coding strand of the MMR encoding sequence. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an MMR protein. The 
coding strand may also include regulatory regions of the MMR sequence. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which 

25 are translated into amino acid residues (e.g., the protein coding region of human PMS2 
corresponds to the coding region SEQ ID NO: 17). In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding an MMR protein. The term "noncoding region" refers to 5' 
and 3' sequences which flank the coding region that are not translated into amino acids 

30 (i.e., also referred to as 5' and 3' untranslated regions (UTR)). 

Preferably, antisense oligonucleotides are directed to regulatory regions of a 
nucleotide sequence encoding an MMR protein, or mRNA corresponding thereto, 


WO 02/054856 


16 


PCT/US01/00934 


including, but not limited to, the initiation codon, TATA box, enhancer sequences, and the 
like. Given the coding strand sequences provided herein, antisense nucleic acids of the 
invention can be designed according to the rules of Watson and Crick or Hoogsteen base 
pairing. The antisense nucleic acid molecule can be complementary to the entire coding 
5 region of an MMR mRNA, but more preferably is an oligonucleotide that is antisense to 
only a portion of the coding or noncoding region of an MMR mRNA. For example, the 
antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of an MMR mRNA. An antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. 

10 Screening is any process whereby a chemical compound is exposed to a cell or 

whole organism. The process of screening can be carried out using but not limited to a 
whole animal, plant, insect, microbe, or by using a suspension of one or more isolated cells 
in culture. The cell can be any type of eukaryotic or prokaryotic cell, including, for 
example, cells isolated from humans or other primates, mammals or other vertebrates, 

15 invertebrates, and single celled organisms such as protozoa, yeast, or bacteria. 

In general, screening will be carried out using a suspension of cells, or a single cell, 
but other methods can also be applied as long as a sufficient fraction of the treated cells or 
tissue is exposed so that isolated cells can be grown and utilized. Techniques for chemical 
screening are well known to those in the art. Available techniques for screening include 

20 cell-based assays, molecular assays, and whole organism-based assays. Compounds can 
be added to the screening assays of the invention in order to identify those agents that are 
capable of blocking MMR in cells. 

The screening assays of the invention provide a system wherein a cell, cells or a 
whole organism is contacted with a candidate compound and then tested to determine 

25 whether mismatch repair has been adversely affected. The method in which MMR is 
analyzed may be any known method, mcluding, but not limited to analysis of the 
molecular sequence of the MMR gene, and analyzing endogenous repeats in the subject's 
genome. Further, the invention provides a convenient assay to analyze the effects of 
candidate agents on reporter genes transfeeted into cells. 

30 MMR-inhibitors identified by the methods of the invention can be used to generate 

new mutations in one or more gene(s) of interest A gene of interest can be any gene 
naturally possessed by a cell line, microbe or whole organism. An advantage of using 
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chemicals rather than recombinant technologies to block MMR are that the process is 
faster, there is no need to produce stable clones with a knocked out MMR gene or a clone 
expressing a dorninant negative MMR gene allele. Another advantage is that host 
organisms need not be screened for integrated knock out targeting vectors or stable 
5 expression of a dorninant negative MMR gene allele. Finally, once a cell, plant or animal 
has been exposed to the MMR-blocking compound and a new output trait is generated, the 
MMR process can be restored by removal of compound. Mutations can be detected by 
analyzing the genotype of the cell, or whole organism, for example, by examining the 
sequence of genomic DNA, cDNA, messenger RNA, or amino acids associated with the 

10 gene of interest. Mutations can also be detected by screening for new output traits such as 
hypoxanlhme-guanine phosphoribosyltransferase (HPRT) revertants. A mutant 
polypeptide can be detected by identifying alterations in electrophoretic mobility, 
spectroscopic properties, or other physical or structural characteristics of a protein encoded 
by a mutant gene. One can also screen for altered function of the protein in situ, in isolated 

15 form, or in model systems. One can screen for alteration of any property of the cell, plant 
or animal associated with the function of the gene of interest. 

Several advantages exist in generating genetic mutations by blocking MMR in vivo 
in contrast to general DNA damaging agents such as MNNG, MNU and EMS. Cells with 
MMR deficiency have a wide range of mutations dispersed throughout their entire genome 

20 in contrast to DNA damaging agents such as MNNG, MNU, EMS and ionizing radiation. 
Another advantage is that mutant cells that arise from MMR deficiency are diploid in 
nature and do not lose large segments of chromosomes as is the case of DNA damaging 
agents such as EMS, MNU, and ionizing radiation (Honma, M. et al. (1997) Mutat. Res. 
374:89-98). This unique feature allows for subtle changes throughout a host's genome that 

25 leads to subtle genetic changes yielding genetically stable hosts with commercially 
important output traits. 

The invention also encompasses blocking MMR in vivo and in vitro and further 
exposing the cells or organisms to a chemical mutagen in order to increase the incidence of 
genetic mutation. 

30 The invention also encompasses withdrawing exposure to inhibitors of mismatch 

repair once a desired mutant genotype or phenotype is generated such that the mutations 
are thereafter maintained in a stable genome. 
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The above disclosure generally describes the present invention. A more complete 
understanding can be obtained by reference to the following specific examples, which are 
provided herein for purposes of illustration only, and are not intended to limit the scope of 
1he invention. 

5 EXAMPLES 

EXAMPLE 1: Generation of a cell-based screening assay to identify chemicals 
capable of inactivating mismatch repair in vivo. 

A hallmark of MMR deficiency is the generation of unstable microsatellite repeats 
in the genome of host cells (Peinado, M.A. et al. (1992) Proc. Natl. Acad. Sci. USA 

10 89:10065-10069; Strand, M. et al. (1993) Nature 365:274-276; Parsons, R. et al. (1993) 
Cell 75:1227-1236). This phenotype is referred to as microsatellite instability (MI) (Harfe, 
B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399; Modrich, P. (1994) 
Science 266:1959-1960; Peinado, M.A. et al. (1992) Proc. Natl. Acad. Sci. USA 89:10065- 
10069; Perucho, M. (1996) Biol Chem. 377:675-684; Hoang, J.M. et al. (1997) Cancer 

15 Res. 57:300-303; Strand, M. et <z/.(1993) Nature 365:274-276). MI consists of deletions 
and/or insertions within repetitive mono-, di- and/or tri nucleotide repetitive sequences 
throughout the entire genome of a host cell. Extensive genetic analysis of eukaryotic cells 
have found that the only biochemical defect that is capable of producing MI is defective 
MMR (Harfe, B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399; Modrich, 

20 P. (1994) Science 266:1959-1960; Peinado, M.A. et al. (1992) Proc. Natl. Acad. Sci. USA 
89:10065-10069; Perucho, M. (1996) Biol. Chem. 377:675-684; Hoang, J.M. et al. (1997) 
Cancer Res. 57:300-303; Strand, M. et c/.(1993) Nature 365:274-276). In light of this 
unique feature that defective MMR has on promoting microsatellite instability, 
endogenous MI is now used as a biochemical marker to survey for lack of MMR activity 

25 witfiin host cells (Hoang, J.M. et al (1997) Cancer Res. 57:300-303). 

A method used to detect MMR deficiency in eukaryotic cells is to employ a 
reporter gene that has a polynucleotide repeat inserted within the coding region that 
disrupts its reading frame due to a frame shift. In the case where MMR is defective, the 
reporter gene will acquire random mutations (i.e., insertions and/or deletions) within the 

30 polynucelotide repeat yielding clones that contain a reporter with an open reading frame. 
This reporter gene can be of any biochemical pathway such as but not limited to 0- 
glucoronidase, p-galactosidase, neomycin resistant gene, hygromycin resistance gene, 
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green fluorescent protein, and the like. A schematic diagram of MMR-sensitive reporters 
are shown in Fig. 1, where the polynucleotide repeat can consist of mono-, di-, tri- or tetra- 
nucleotides. We have employed the use of a P-galactosidase MMR-sensitive reporter gene 
to measure for MMR activity in H3 6 cells, which are a murine hybridoma cell line. The 
5 reporter construct used is called pCAR-OF, which contains a hygromycin resistance 

(HYG) gene plus a P-galactosidase gene with a 29 bp out-of-frame poly-CA tract inserted 
at the 5' end of its coding region. The pCAR-OF reporter cannot generate p-galactosidase 
activity unless a frame-restoring mutation (i.e., insertion or deletion) arises following 
transfection. This line has been shown to be sensitive to inactivated MMR where using a 

10 dominant negative MMR gene allele has found this condition to result in the production of 
P-galactosidase (unpublished data). An example of these data using the dominant 
negative PMS134 allele is shown in Table 1 . Briefly, H36 cells were each transfected with 
an expression vector containing the PMS134 allele (referred to as HB134) or empty vector 
and the pCAR-OF vector in duplicate reactions using the protocol below. The PMS134 

15 gene is cloned into the pEF expression vector, which contains the elongation factor 

promoter upstream of the cloning site followed by a mammalian polyadenylation signal. 
This vector also contains the NEOr gene that allows for selection of cells in G418 to 
identify those retaining this plasmid. Briefly, cells were transfected with 1 ug of the 
PMS134 or empty vector using polyliposomes following the manufacturer's protocol (Life 

20 Technologies). Cells were then selected in 0.5 mg/ml of G418 for 10 days and G418 

resistant cells were pooled together to analyze for gene expression. PMS134 positive cells, 
which were determined by RT-PCR and western blot (not shown) were expanded and 
transfected with the pCAR-OF reporter gene that contains a hygromycin (HYG) resistance 
gene as reporter using the protocol described above. Cells were selected in 0.5 mg/ml 

25 G418 and 0.5mg/ml HYG to select for cells retaining both the MMR effector and the 

pCAR-OF reporter plasmids. All cultures transfected with the pCAR vector resulted in a 
similar number of HYG/G418 resistant cells. Cultures were then expanded and tested for 
P-galactosidase activity in situ as well as by biochemical analysis of cell extracts. For in 
situ analysis, 100,000 cells were harvested and fixed in 1% gluteraldehyde, washed in 

30 phosphate buffered saline solution and incubated in 1 ml of X-gal substrate solution [0.15 
M NaCL 1 mM MgC^, 3.3 mM K,tFe(CN) 6 , 3.3 mM K3Fe(CN) 6 , 0.2% X-Gal ] in 24 well 
plates for 2 hours at 37°C. Reactions were stopped in 500 mM sodium bicarbonate 
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solution and transferred to microscope slides for analysis. Three fields of 200 cells each 
were counted for blue (p-galactosidase positive cells) or white (P-galactosidase negative 
cells) to assess for MMR inactivation. Table 1 shows the results from these studies. 
While no p-galactosidase positive cells were observed in H36 empty vector cells and 10% 
5 of the cells per field were P-galactosidase positive in HB 1 34 cultures. 


Table 1. P-galactosidase expression of H36 empty vector and HB134 cells transfected 
with pCAR-OF reporter vectors. Cells were transfected with the pCAR-OF reporter 
plasmid. Transfected cells were selected in HYG and G418, expanded and stained with 
10 X-gal solution to measure for P-galactosidase activity (blue colored cells). 3 fields of 200 
cells each were analyzed by microscopy. The results below represent the mean +/- 
standard deviation of these experiments. 


Table 1. 


CELL LINE 

# BLUE CELLS 

H36 empty vector 

0+/-0 

HB134 

20+/- 3 


15 

Cultures can be further analyzed by biochemical assays using cell extracts to measure p- 
galactosidase activity as previously described (Nicolaides, N.C. et al. (1998) Mol. Cell. 
Biol. 18:1635-1641). 

The data described in Table 1 show that by inhibiting the MMR activity of an 
20 MMR proficient cell host can result in MI and the altering of microsatelhtes in the pCAR- 
OF vector results in cells that produce functional P-galactosidase enzyme. The use of the 
H36pCAR-OF cell line can now be used to screen for chemicals that are able to block 
MMR of the H36 cell line. 


25 EXAMPLE 2: Screening assays for identifying chemical blockers of MMR. 

A method for screening chemical libraries is provided in this example using the 
H36pCAR-OF cell line described in Example 1. This cell line is a hardy, stable line that 
can be formatted into 96- well microliter plates for automated screening for chemicals that 
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specifically block MMR. An overview of the screening process is given in Figure 2, 
however, the process is not limited to the specifications within this example. Briefly, 
10,000 cells in a total volume of 0.1ml of growth medium (RPMI1640 plus 10% fetal 
bovine serum) are added to 96-well microtiter plates containing any variety of chemical 
5 compounds. Cells are grown for 14-17 days at 37°C in 5%C0 2 . Cells are then lysed in the 
growth medium with 50uls of lysis buffer containing 0.1 MTris buffer (pH 8.0), 0.1% 
Triton X-100, 45 mM 2-mercaptoethanol, ImM MgClj, 0.1 M NaP0 4 and 0.6 mg/ml 
Chlorophenol-red- P-D-galactopyranoside (CPRG, Roche). Reactions are incubated for 1 
hour, terminated by the addition of 50 uls of 0.5 M Na^Os, and analyzed by 

10 spectrophotometry at 576 nm. 

Experimental wells are compared to untreated or vehicle treated wells to identify 
those with increased p-galactosidase activity. Compounds producing MMR blocking 
activity are then nirther analyzed using different cell lines containing the pCAR-OF 
plasmid to measure the ability to block MMR as determined by MI in MMR proficient 

15 hosts by analyzing endogenous microsatellites for instability using assays described below. 

EXAMPLE 3: Defining MMR blocking chemicals. 

The identification of chemical inhibitors of MMR can be difficult in determining 
those that are standard mutagens from those that induce genomic instability via the 

20 blockade of MMR. This Example teaches of a method for detennining blockers of MMR 
from more general mutagens. Once a compound has been identified in the assay 
described above, one can determine if the compound is a general mutagen or a speific 
MMR blocker by monitoring mutation rates in MMR proficient cells and a controlled 
subclone that is MMR defective. One feature of MMR deficiency is the increased 

25 resistance to toxicity of DNA alkylating agents that allows for enhanced rates of mutations 
upon mutagen exposure (Liu, L., et.al. Cancer Res (1996) 56:5375-5379). This unique 
feature allows for the use of a MMR proficient cell and a controlled line to measure for 
enhanced activity of a chemical compound to induce mutations in MMR proficient vs 
MMR deficient lines. If the compound is a true inhibitor of MMR then genetic mutations 

30 should occur in MMR proficient cells while no "enhanced " mutation rate will be found in 
already MMR defective cells. Using these criteria chemicals such as ICR191, which 
induces frameshift mutations in mammalian cells would not he considered a MMR 
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reporter construct used is called pCAR-OF, which contains a hygromycin resistance 
(HYG) gene plus a p-galactosidase gene with a 29 bp out-of-frame poly-CA tract inserted 
at the 5' end of its coding region. The pCAR-OF reporter cannot generate P-galactosidase 
activity unless a frame-restoring mutation (i.e., insertion or deletion) arises following 
5 transfection. This line has been shown to be sensitive to inactivated MMR where using a 
dominant negative MMR gene allele has found this condition to result in the production of 
P-galactosidase (unpublished data). An example of these data using the dominant 
negative PMS134 allele is shown in Table 1. Briefly, H36 cells were each transfected with 
an expression vector containing the PMS134 allele (referred to as HB134) or empty vector 

10 and the pCAR-OF vector in duplicate reactions using the protocol below. The PMS134 
gene is cloned into the pEF expression vector, which contains the elongation factor 
promoter upstream of the cloning site followed by a marnmalian polyadenylation signal. 
This vector also contains the NEOr gene that allows for selection of cells in G418 to 
identify those retaining this plasmid. Briefly, cells were transfected with 1 ug of the 

15 PMS 1 34 or empty vector using polyliposomes following the manufacturer's protocol (Life 
Technologies). Cells were then selected in 0.5 mg/ml of G418 for 10 days and G418 
resistant cells were pooled together to analyze for gene expression. PMS 134 positive cells, 
which were determined by RT-PCR and western blot (not shown) were expanded and 
transfected with the pCAR-OF reporter gene that contains a hygromycin (HYG) resistance 

20 gene as reporter using the protocol described above. Cells were selected in 0.5 mg/ml 
G418 and 0.5mg/ml HYG to select for cells retaining both the MMR effector and the 
pCAR-OF reporter plasmids. All cultures transfected with the pCAR vector resulted in a 
similar number of HYG/G418 resistant cells. Cultures were then expanded and tested for 
P-galactosidase activity in situ as well as by biochemical analysis of cell extracts. For in 

25 situ analysis, 100,000 cells were harvested and fixed in 1% gluteraldehyde, washed in 
phosphate buffered saline solution and incubated in 1 ml of X-gal substrate solution [0.15 
M NaCl, 1 mM MgCL., 3.3 mM K^CN),* 3.3 mM K 3 Fe(CN) 6 , 0.2% X-Gal ] in 24 well 
plates for 2 hours at 37 C C. Reactions were stopped in 500 mM sodium bicarbonate 
solution and transferred to microscope slides for analysis. Three fields of 200 cells each 

30 were counted for blue (P-galactosidase positive cells) or white (P-galactosidase negative 
cells) to assess for MMR inactivation. Table 1 shows the results from these studies. 
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blocking compound because of its ability to produce enhanced mutation rates in already 
MMR defective cell lines (Chen, W.D., etal. J Natl Cancer Inst. (2000) 92:480-485). 
These screening lines include the but are not limited those in which a dominant negative 
MMR gene has been introduced such as that described in EXAMPLE 1 or those in which 
5 naturally MMR deficient cells such as HCT1 16 has been cured by introduction of a 
complementing MMR gene as described (Chen, W.D., etal. J Natl Cancer Inst. (2000) 
92:480-485). 

EXAMPLE 4: Identification of chemical inhibitors of MMR in vivo. 

10 MMR is a conserved post replicative DNA repair mechanism that repairs point 

mutations and insertion/deletions in repetitive sequences after cell division. The MMR 
requires an ATPase activity for initiation complex recognition and DNA translocation. In 
vitro assays have shown that the use of nonhydrolyzable forms of ATP such as AMP-PNP 
and ATP[gamma]S block the MMR activity (Galio, L. et al. (1999) Nucl. Acids Res. 

15 27:2325-233 1 ; Allen, D J. et al. (1997) EMBO J. 16:4467-4476; Bjornson K.P. et al. 
(2000) Biochem. 39:3176-3183). 

The use of chemicals to inhibit endogenous MMR in vivo has not been 
distinguished in the public domain. In an attempt to identify chemicals that can inhibit 
MMR in vivo, we used our H36pCAR-OF screening assay to screen for chemicals that are 

20 able to cause microsatellite instability and restoration of P-galactosidase activity from the 
pCAR-OF vector, an effect that can only be caused due to MMR deficiency. In our 
screening assays we used a variety of classes of compounds ranging from steroids such as 
pontasterone to potent alkylating agents such as EMS, to kinase and other enzyme 
inhibitors. Screens identified one class of chemicals that were capable of generating |3- 

25 galactosidase positive cells. These molecules were derived from the anthracene class. An 
example of one such anthracene derivative for the purposes of this application is a 
molecule called 9,10-dimethylanthracene, referred to from here on as DMA. Fig. 3 shows 
- the effect of DMA in shifting the pCAR-OF reporter plasmid. ha contrast, general DNA 
alkylating agents such as EMS or MNNG did not result in MI and/or the sMfting of the 

30 polynulceotide tract in the pCAR-OF reporter. 

The most likely explanation for the differences in P-galactosidase activity was that 
the DMA compound disturbed MMR activity, resulting in a higher frequency of mutation 
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within the pCAR-OF reporter and re-establishing the ORF. To directly test the hypothesis 
that MMR was altered, we employ a biochemical assay for MMR with the individual 
clones as described by Nicolaides et al, 1997 (Nicolaides, N.C. et al. (1998) Mol. Cell. 
Biol 18:1635-1641). Nuclear extracts are prepared from the clones and incubated with 
5 heteroduplex substrates containing either a /CA\ insertion-deletion or a G/T mismatch 
under conditions described previously. The /CA\ and G/T heteroduplexes are used to test 
repair from the 3' and 5' directions, respectively as described (Nicolaides, N.C. et al. 
(1998) Mol Cell Biol 18:1635-1641). 

10 Biochemical assays for mismatch repair. 

Enzymatic Repair Assays: 

MMR activity in nuclear extracts is performed as described, using 24 fmol of 

substrate (Nicolaides, N.C. et al. (1998) Mol Cell. Biol. 18:1635-1641). 

Complementation assays are done by adding ~ 100 ng of purified MutLa or MutSa 
15 components to 100 ug of nuclear extract, adjusting the final KC1 concentration to 100 mM 

(Nicolaides, N.C. et al (1998) Mol Cell. Biol 18:1635-1641). The substrates used in 

these experiments contain a strand break 181 nucleotides 5' or 125 nucleotides 3' to the 

mismatch. 

20 Biochemical Activity Assays : 

To demonstrate the direct effect to small molecules on MMR proteins, molecular 
assays such as mismatch binding and MMR complex formation are performed in the 
presence or absence of drug. Briefly, MMR gene cDNAs are PCR amplified using primers 
encompassing the entire coding regions of the known MMR proteins MSH2 (SEQ ID 

25 NO:20), GTBP (SEQ ID NO:26), MLH1 (SEQ ID NO:22), human PMS2 (SEQ ID 
NO:16), mouse PMS2 (SEQ ID NO:14), PMS1 (SEQ ID NO:18), and MHS3 (SEQ ID 
NO:28) from any species with a sense primer containing a T7 promoter and a Kozak 
translation signal as previously described (Nicolaides, N.C. et al. (1998) Mol. Cell. Biol 
18:1635-1641). The coding regions of known MMR proteins include the sequences shown 

30 in Table 3 for mouse PMS2 (SEQ ID NO.15), human PMS2 (SEQ ID NO:17), human 

PMS1 (SEQ ID NO:19), human MSH2 (SEQ ID NO:21), human MLH1 (SEQ ID NO:23), 
and human MSH3 (SEQ ID NO:29). Products are transcribed and translated using the 
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TNT system (Promega). An example of PCR primers and in vitro transcription- 
translation reactions are listed below. 

In vitro transcription-translation: 

5 Linear DNA fragments containing hPMS2 (SEQ ID NO: 1 7) and hMLHl (SEQ ID 

NO:23) cDNA sequences were prepared by PCR, incorporating sequences for in vitro 
transcription and translation in the sense primer. A full-length hMLHl fragment was 
prepared using the sense primer 

5'-ggatcctaatacgactcactatagggagaccaccatgtcgttcgtggcaggg-3' (SEQ ID NO:l)(codons 1-6) 
10 and the antisense primer 5'-taagtcttaagtgctaccaac-3' (SEQ ED NO:2)(located in the 3' 

untranslated region, nt 241 1-2433), using a wild-type hMLHl cDNA clone as template. A 
full-length hPMS2 fragment was prepared with the sense primer 

S'-ggatcctaatacgactcactatagggagaccaccatggaacaattgcctgcgg-S* (SEQ ID NO:3)(codons 1-6) 
and the antisense primer 5'-aggttagtgaagactctgtc-3' (SEQ ID NO:4)(located in 3' 

15 untranslated region, nt 2670-2690) using a cloned HPMS2 cDNA as template. These 
fragments were used to produce proteins via the coupled transcription-translation system 
(Promega). The reactions were supplemented with 3S S-labelled methionine or unlabelled 
methionine. Lower molecular weight bands are presumed to be degradation products 
and/or polypeptides translated from alternative internal methionines. 

20 To study the effects of MMR inhibitors, assays are used to measure the formation 

of MLH1 and PMS2 with or without compound using polypeptides produced in the TNT 
System (Promega) followed by immunoprecipitation (TP). To facilitate the IP, tags may be 
placed at the C-terminus of the PMS2 protein to use for antibody binding or antibodies 
directed to the MMR protein itself can be used for IP . 

25 Immunoprecipitations. 

Immunoprecipitations are performed on in vitro translated proteins by mixing the 
translation reactions with 1 ug of the MLH1 specific monoclonal antibody (mAB) MLH14 
(Oncogene Science, Inc.), a polyclonal antibody generated to codons 2-20 of hPMS2 
described above, or a polyclonal antibody generated to codons 843-862 of hPMS2 (Santa 

30 Cruz Biotechnology, Inc.) in 400 ul of EBC buffer (50 mM Tris, pH 7.5, 0. 1 M NaCl, 

0.5% NP40). After incubation for 1 hr at 4°C, protein A sepharose (Sigma) is added to a 
final concentration of 10% and reactions are incubated at 4°C for 1 hour. Proteins bound 
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to protein A are washed five times in EBC and separated by electrophoresis on 4-20% 
Tris-glycine gels, which are then dried and autoradiographed. 

Compounds that block heterodimerization of mutS or mutL proteins can now be 
identified using this assay. 

5 

EXAMPLE 5: Use of chemical MMR inhibitors yields microsatellite instability in 
human cells 

In order to demonstrate the global ability of a chemical inhibitor of MMR in host 
cells and organisms, we treated human HEK293 cells (referred to as 293 cells) with DMA 

10 and measured for microsatellite instability of endogenous loci using the BAT26 diagnostic 
marker (HoangJ.M. etal. (1997) Cancer Res. 57:300-303). Briefly, 10 s cells were grown 
in control medium or 250 uM DMA, a concentration that is found to be non-toxic, for 14 
to 17 days. Cells are then harvested and genomic DNA isolated using the salting out 
method (Nicolaides, N.C. etal. (1991) Afo/. Cell. Biol. 11:6166-6176.). 

15 Various amounts of test DNAs from HCT1 16 (a human colon epithelial cell line) 

and 293 were first used to determine the sensitivity of our microsatellite test. The BAT26 
alleles are known to be heterogeneous between these two cell lines and the products 
migrate at different molecular weights (Nicolaides personal observation). DNAs were 
diluted by limiting dilution to determine the level of sensitivity of the assay. DNAs were 

20 PCR amplified using the BAT26F: 5'-tgactacttttgacttcagcc-3' (SEQ ID NO:43) and the 
BAT26R: 5'-aaccattcaacatttttaaccc-3' (SEQ ID NO:44) primers in buffers as described 
(Nicolaides, N.C. et al. (1995) Genomics 30:195-206). Briefly 1 pg to 100 ngs of DNA 
were amplified using the following conditions: 94°C for 30 sec, 58°C for 30 sec, 72°C for 
30 sec for 30 cycles. PCR reactions were electrophoresed on 12% polyacrylamide TBE 

25 gels (Novex) or 4% agarose gels and stained with ethidium bromide. These studies found 
that 0. 1 ng of genomic DNA was the limit of detection using our conditions. 

To measure for microsatellite stability in 293 cells grown with or without DMA, 
0.1 ngs of DNA from DMA-treated or control 293 cells were amplified using the reaction 
conditions above. Forty individual reactions were carried out for each sample to measure 

30 for minor alleles. Fig. 4A shows a typical result from these studies whereby BAT26 

alleles were amplified from DMA-treated and untreated cells and analyzed on 12% PAGE 
gels (Novex). Alleles from DMA-treated cells showed the presence of an altered allele 
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(asterisk) that migrated differently from the wild type allele. No altered alleles were found 
in the MMR-proficient control cells as expected since MI only occurs in MMR defective 
cell hosts. To confirm these data, PCRs were repeated using isolated BAT26 products. 
Primers and conditions were the same as described above except that reactions were 
5 amplified for 20 cycles. PCR products were gel-purified and cloned into T-tailed vectors 
(InVitrogen) as suggested by the manufacturer. Recombinant clones from DMA-treated 
and control cells were screened by PCR again using the BAT26 primers. Fifty bacterial 
colonies were analyzed for BAT26 structure by directly adding an aliquot of live bacteria 
to the PCR mix. PCR reactions were carried out as described above, and products were 

10 electrophoresed on 4% agarose gels and stained with ethidium bromide. As shown in 

Figure 4B, microsatellites from DMA-treated cells had alterations (asterisks) that made the 
marker length larger or smaller than the wild type allele found in control cells. 

To confirm that these differences in molecular weight were due to shifts within the 
polynucleotide repeat, a hallmark of defective MMR, five clones from each sample were 

15 sequenced using an ABI automated sequencer with an Ml 3-R primer located in the T-tail 
vector backbone. Sequence analysis revealed that the control cell clone used in our studies 
was homo2ygous for the BAT26 allele with a 26nt polyA repeat. Cells treated with DMA 
found multiple alleles ranging from the wild-type with 26 polyA repeat to shorter alleles 
(24 polyA repeat) and larger alleles (28 polyA repeat) (Fig. 5). 

20 These data corroborate the H36pCAR data in Example 1 and Fig. 3 and 

demonstrates the ability to block MMR with a chemical in a range of hosts. 

Example 6: Chemical inhibitors of MMR generate DNA hypermutability in Plants 
and new phenotypes. 

25 To determine if chemical inhibitors of MMR work across a diverse array of 

organisms, we explored the activity of DMA on Arabidopsis thaliana (AT), a member of 
the mustard plant family, as a plant model system to study the effects of DMA on 
generating MMR deficiency, genome alterations, and new output traits. 

Briefly, AT seeds were sterilized with straight commercial bleach and 100 seeds 

30 were plated in 100mm Murashige and Skoog (MS) phytagar (Life Technology) plates with 
increasing amounts of DMA (ranging from lOOum to 50mM). A similar amount of seeds 
were plated on MS phytagar only or in MS phytagar with increasing amounts of EMS 
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(lOOuM to 50mM), a mutagen commonly used to mutate AT seeds (McCallum, C.M.e* al. 
(2000) Nat. Biotechnol.l%'A55A51). Plates were grown in a temperature-controlled, 
fluorescent-lighted humidifier (Percival Growth Chamber) for 10 days. After 10 days, 
seeds were counted to determine toxicity levels for each compound. Table 2 shows the 
5 number of healthy cells/treatment as determined by root formation and shoot formation. 
Plantlets that were identical to untreated seeds were scored healthy. Seeds with stunted 
root or shoot formation were scored intermediate (inter). Non-germinated seeds were 
scored dead. 

Table 2: Toxicity curve of DMA and EMS on Arabidopsis (per 100 cells) 
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10 

The data in Table 2 show that DMA toxicity occurs at lOmM of continuous 
culture, while toxicity occurs at 250 uM for EMS. Next, 50 seeds were plated in two 
150mm dishes containing 2mM DMA, 250 uM EMS or no drug. Seeds were grown for 10 
days and then 10 plants from each plate were transferred to soil. All plants appeared to be 

15 similar in color and height. Plants were grown at room temperature with daily cycles of 1 8 
hr light and 6 hr dark. After 45 days seeds are harvested from siliques and stored in a 
desiccator at 4°C for 72 hours. Seeds are then sterilized and 100 seeds from each plant is 
sown directly into water-saturated soil and grown at room temperature with daily cycles of 
18 hr light and 6 hr dark. At day 10 phenotypically distinct plants were found in 7 out of 

20 118 DMA treated while no phenotypic difference was observed in 1 50 EMS-treated or 1 50 
control plants. These 7 altered plants were light green in color and appeared to grow 
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slower. Fig. 6 shows a typical difference between the DMA altered plant and controls. 
DMA-exposed plants produced offspring that were yellow in appearance in contrast to 
dark green, which is always found in wild-type plants. In addition, the yellow plants were 
also shorter. After 30 days, most wild-type plants produced flowers and siliques, while the 
5 7 mutants just began flowering. After 45 days, control plants were harvested while mutant 
plants were harvested 10 to 15 days later. No such effects were observed in 150 plantlets 
from EMS treated plants. 

The effect of DMA on MMR was confirmed by monitoring the structure of 
endogenous polynucleotide repeat markers within the plant genome. DNA was extracted 

10 using the DNAzol method following the manufacturer's protocol (Life Technology). 
Briefly, two leaves were harvested from DMA, EMS or untreated plants and DNA was 
extracted. DNAs were quantified by optical density using a BioRad Spectrophotometer. 
hiArabidopsis, a series of poly-A (A) n , (CA) n and (GA) n markers were found as a result of 
EMBL and GenBank database searches of DNA sequence data generated as a result of the 

15 Arabidopsis genome-sequencing project. Two markers that are naturally occurring, 

ATHACS and Ngal28 are used to monitor microsatellite stability using primers described 
(Bell, CJ. and J.R. Ecker (1994) Genomics 19:137-144). ATHACS has a stretch of thirty- 
six adenine repeats (A) 36 whereas Ngal28 is characterized by a di-nucleotide AG repeat 
that is repeated nineteen times (AG) 19 while the Nga280 marker contains a polyAG repeat 

20 marker with 15 dinucleotides. DMA-mediated alterations of these markers are measured 
by a PCR assay. Briefly, the genomic DNA is amplified with specific primers in PCR 
reaction buffers described above using l-10ng plant genomic DNA. Primers for each 
marker are listed below: 
nga280: 

25 nga280-F: 5'-CTGATCTCACGGACAATAGTGC-3' (SEQ ID NO:5) 

nga280-R: 5'-GGCTCCATAAAAAGTGCACC-3' (SEQIDNO:6) 

ngal28: 

ngal28-F: 5'-GGTCTGTTGATGTCGTAAGTCG-3' (SEQIDNO:7) 
30 ngal28-R: 5 '-ATCTTGAAACCTTTAGGGAGGG-3 ' (SEQ ID NO:8) 

ATHACS: 

ATHACS-F: 5 '-AGAAGTTTAGACAGGTAC-3 ' (SEQ ID NO:9) 
ATHACS-R: 5'-AAATGTGCAATTGCCTTC-3' (SEQIDNO:10) 
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Cycling conditions are 94°C for 15 seconds, 55°C for 15 seconds and 72°C for 30 
seconds, conditions that have been demonstrated to efficiently amplify these two markers 
(personal observation, Morphotek). PCR products are analyzed on 3.5% metaphor agarose 
gel in Tris-Acetate-EDTA buffer following staining with ethidium bromide. 
5 Another method used to demonstrate that biochemical activity of a plant host's 

MMR is through the use of reporter genes disrupted by a polynucleotide repeat, similar to 
that described in Example 1 and Fig. 1. Due to the high endogenous P-galactosidase 
background, we engineered a plant compatible MMR-sensitive reporter gene consisting of 
the P-glucoronidase (GUS) gene with a mononucleotide repeat that was inserted just 

10 downstream of the initiation codon. Two reporter constructs were generated. pGUS-OF, 
contained a 20 base adenine repeat inserted just downstream of the initiating methionine 
that resulted in a frameshift, therefore producing a nonfunctional enzyme. The second, 
pGUS-IF, contained a 19 base adenine repeat that retained an open reading frame and 
served as a control for p-glucoronidase activity. Both constructs were generated by PCR 

15 using the pBI-121 vector (Life Technologies) as template. The antisense primer was 

directed to the 3' end of the Nopaline Synthase (NOS) polytermination sequence contained 
within the pBI-121 plasmid and contained a unique EcdBI restriction site to facilitate 
cloning of the vector into the pBI-121 binary vector backbone. The sense primers 
contained a unique BamHL restriction site to facilitate cloning of the chimeric GUS 

20 reporter gene into the pBI-121 binary vector backbone. The primers used to generate each 
reporter are: 

1. sense primer for pGUS-IF (uidA-ATG-polyA-IF) : 

5'- CCC GGA TCC ATG TTA AAA AAA AAA AAA AAA AAA CGT CCT GTA GAA ACC-3' (SEQ 
ID NO: 11) 

2. sense primer for pGUS-OF (uidA-ATG-polyA-OF) : 

5'- CCC GGA TCC ATG TTA AAA AAA AAA AAA AAA AAA ACG TCC TGT AGA AAC C-3' 
{SEQ ID NO: 12) 

3. antisense primer (Nos-term) : 

5'- CCC GAA TTC CCC GAT CTA GTA ACA TAG ATG-3' (SEQ ID NO: 13) 

PCR amplifications were carried out using reaction buffers described above. 
35 Reactions were performed using 1 ng of pBI-121 vector as template (Life Technologies) 
and the appropriate corresponding primers. Amplifications were carried at 94°C for 30 
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seconds, 54°C for 60 seconds and 72°C for 60 seconds for 25 cycles. PCR products of the 
expected molecular weight was gel purified, cloned into T-tailed vectors (InVitrogen), and 
sequenced to ensure authentic sequence using the following primers: CaMV-FORW. [= 5'- 
gat ate tec act gac gta ag-3'] (SEQ ID NO.30) for sequencing from the CaMV promoter 
5 into the 5 ' end of GUS cDNAs; NOSpA-42F [= 5 '-tgt tgc egg tot tgc gat g-3 '] (SEQ ID 
NO:3 1) for sequencing of the NOS tenriinator; NOSpA-Cend-R [= 5'-ccc gat eta gta aca 
tag atg-3'] (SEQ ID NO:32) for sequencing from the NOS terminator into the 3' end of the 
GUS cDNAs; GUS-63F [= 5'-cag tot gga teg cga aaa ctg-3'] (SEQ ID NO:33), GUS-441F 
[= 5'-ggt gat tac cga cga aaa cg-3'] (SEQ ID NO:34), GUS-825F [= 5'-agt gaa ggg cga aca 

10 gtt cc-3'] (SEQ ID NO:35), GUS-1224F [= 5'-gag tat tgc caa cga acc-3'] (SEQ ID NO:36), 
GUS-1596F [= 5'-gta tea ccg cgt ctt tga tc-3'] (SEQ ID NO:37), GUS-265R [= 5'-cga aac 
gca gca cga tac g-3'] (SEQ ID NO:38), GUS-646R [= 5 '-gtt caa cgc tga cat cac c-3'] (SEQ 
ID NO:39), GUS-1033R [= 5'-cat gtt cat ctg ccc agt cg-3'] (SEQ ID NO:40), GUS-1425R 
[= 5'-gct ttg gac ata cca tcc-3'] (SEQ ID NO:41), and GUS-1783R [= 5'-cac cga agt tea tgc 

15 cag-3 '] (SEQ ID NO:42) for the sequence of the full length GUS cDNAs. No mutation 
were found in either the OF or IF version of the GUS cDNA, and the expected frames for 
both cDNAs were also confirmed. pCR-IF-GUS and pCR-OF-GUS plasmids were 
subsequently digested with the BamH I and EcoR I restriction endonucleases, to generate 
DNA fragments containing the GUS cDNA along with the NOS terminator. These 

20 fragments were ligated into the BamH I and the EcoR I sites of the pBI-121 plasmid, 
which was prepared for cloning by cutting it with the same enzymes to release the wild 
type GUS cDNA. The resulting constructs (pBI-IF-GUS andpBI-OF-GUS) were 
subsequently digested with Hind HI and EcoR I to release the DNA fragments 
encompassing the CaMV promoter, the IF or OF GUS cDNA, and the NOS terminator. 

25 Finally, these fragments were ligated into the correspondent restriction sites present in the 
pGPTV-HPT binary vector (ATCC) to obtain the pCMV-IF-GUS-HPT and pCMV-OF- 
GUS-HPT binary vectors. 

The resulting vectors, CMV-OF-GUS-HPT and CMVJF-GUS-HPT now contain 
the CaMV35S promoter from the Cauliflower Mosaic 35 S Virus driving the GUS gene 

30 followed by a NOS terminator and polyadenylation signal (Fig. 7). In addition, this vector 
also contains a hygromycin resistance gene as a selectable marker that is used to select for 
plants containing this reporter. 
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Generation of GUS reporter-expressing Arabidopsis thaliana transgenic plants. 

Agrobacterium tumefaciens bacteria are used to shuttle binary expression vectors 
into plants. To generate P-glucoronidase-expressing^m&Wqpjw thaliana (A. thaliana) 
plants, Agrobacterium tumefaciens cells (strain GV3 101) were electroporated with the 
5 CMV-OF-GUS-HPT or the CMV-IF-GUS-HPT binary vector using methods known by 
those skilled in the art. Briefly, one-month old .4. thaliana (ecotype Columbia) plants 
were infected by immersion in a solution containing 5% sucrose, 0.05% silwet and binary 
vector-transformed Agrobacteria cells for 10 seconds. These plants were then grown at 
25°C under a 16 hour day and 8 hour dark photoperiod. After 4 weeks, seeds (referred to as 

10 Tl) were harvested and dried for 5 days. Thirty thousands seeds from ten CMV-OF-GUS- 
HPT or CMV-IF-GUS-HPT-transformed plants were sown in solid Murashige and Skoog 
(MS) media plates in the presence of 20 ug/ml of hygromycin (HYG). Three hundred 
plants were found to be HYG resistant and represented GUS expressing plants. These 
plants along with 300 control plants were grown in MS media for two weeks and then 

15 transferred to soil. Plants were grown for an additional four weeks under standard 
conditions at which time T2 seeds were harvested. 

To corifirm the integration and stability of the GUS vector in the plant genome, 
gene segregation and PCR analyses were conducted. Commonly, three out of four Tl 
plants transformed by Agrobacteria technology are expected to carry 1he vector inserted 

20 within a single locus and are therefore considered heterozygous for the integrated gene. 
Approximately 75% of the seeds (T2) generated from most of the Tl plants were found 
HYG-resistant and this in accordance with the expected 1 :2: 1 ratio of null (no GUS 
containing plants), hetero2ygous, and homozygous plants, respectively, in self-pollinating 
conditions. To confirm that these plants contained the GUS expression vector, genomic 

25 DNA was isolated from leaves of Tl plants using the DNAzol-mediated technique as 
described above. One ng of genomic DNA was analyzed by polymerase chain reaction 
' (PCR) to confirm the presence of the GUS vector. PCR was carried out for 25 cycles with 
the following parameters: 95°C for 30 seconds; 54°C for 1 minute; and 72°C for 2 minutes 
using primers listed above. Positive reactions were observed in DNA from CMV-OF- 

30 GUS-HPT and CMV-IF-GUS-HPT-transformed plants and not from control (uninfected) 
plants. 
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In order to assess the expression of the GUS in Tl plants, leaf tissue was collected 
from Tl plants, homogenized in liquid nitrogen using glass pestles, and suspended in RLT 
lysing buffer (Qiagen, RNeasy plant RNA extraction kit). Five micrograms of total RNA 
was purified according to the manufacturer's suggested protocol and then loaded onto a 
5 1 .2% agarose gel (lx MOPS buffer, 3% formaldehyde), size-fractionated by 

electrophoresis, and transferred onto N-Hybond+ membrane (Amersham). Each membrane 
was incubated at 55°C in 10 ml of hybridization solution (North2South labeling kit, Pierce) 
containing 100 ng of GUS, tubulin, or HYG probes, which were generated by PCR 
amplification, according to the manufacturer's directions. Membranes were washed three 

10 times in 2x SSC, 0.1% SDS at 55°C, and three times in 2x SSC at ambient temperature. 
Detection was carried out using enhanced chemiluminescence (ECL). GUS message was 
detected in three out of ten analyzed transgenic lines, while no signal was found in the 
control plants. Collectively these studies demonstrated the generation of GUS expressing 
transgenic A tkaliana plants. 

15 To determine the status of MMR activity in host plants, one can measure for the 

production of functional p-glucoronidase by staining plant leaves or roots in situ for P-glu 
activity. Briefly, plant tissue is washed twice with water and fixed in 4 mis of 0.02% 
glutaraldehyde for 15 minutes. Next, tissue is rinsed with water and incubated in X-glu 
solution [0.1M NaP0 4 , 2.5 mM K 3 Fe(CN) 6 , 2.5mM K^FeCCN)* 1.5 mM MgClj, and 1 

20 mg/ml X-GLU (5 bromo-4-chloro-3-indoyl- P-D-glucuronide sodium salt) (Gold 

Biotechnology)] for 6 hours at 37°C. Tissues are then washed twice in phosphate buffered 
saline (PBS) solution, once in 70% ethanol and incubated for 4 hours in methanolracetone 
(3 : 1) for 8 hours to remove chlorophyll. Tissues are then washed twice in PBS and stored 
in PBS with 50% glycerol. Plant tissue with functional GUS activity will stain blue. 

25 The presence of GUS activity in CMV-IF-GUS-HPT plants indicates that the in- 

frame N-terrninus insertion of the poly A repeat does not disrupt the GUS protein function. 
The CMV-OF-GUS-HPT plants treated with DMA, EMS or untreated are tested to 
determine if these plants produce GUS activity. The presence of GUS activity in DMA 
treated plants indicates that the polyA repeat was altered, therefore, resulting in a frame- 

30 restoring mutation. Agents such as EMS, which are known to damage DNA by alkylation 
cannot affect the stability of a polynucleotide repeat. This data indicates that plants are 
defective for MMR, the only process known to be responsible for MI. 
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These data demonstrate the utility and power of using a chemical inhibitor of 
MMR to generate a high degree of genetic alteration that is not capable by means of 
standard DNA damaging drugs. Moreover, this application teaches of the use of reporter 
genes such as GUS-OF in plants to monitor for the MMR activity of a plant host. 

5 

EXAMPLE 7: Use of chemical MMR inhibitors yields microsatellite instability in 
microbes. 

To demonstrate the ability of chemical inhibitors to block MMR in a wide range of 
hosts, we employed the use of Pichia yeast containing a pGUS-OF reporter system similar 

10 to that described in Example 5. Briefly, the GUS-OF and GUS-IF gene, which contains a 
polyA repeat at the N-terminus of the protein was subcloned from the pCRTF-GUS and 
pCR-OF-GUS plasmids into the EcoRI site of the pGP vector, which is a consitutively 
expressed yeast vector containing a zeocin resistance gene as selectable marker. pGP- 
GUS-IF and pGP-GUS-OF vectors were electroporated into competent Pichia cells using 

15 standard methods known by those skilled in the art. Cells were plated on YPD agar (1 Og/L 
yeast extract; 20 g/L peptone; 2% glucose; 1.5% bactoagar) plates containing 100 ug/ml 
zeocin. Recombinant yeast are then analyzed for GUS expression/function by replica 
plating on YPD agar plates containing 100 ug/ml zeocin plus 1 mg/ml X-glu (5-bromo-4- 
chloro-3-indoyl-beta-D-glucuronide sodium salt) and grown at 30°C for 16 hours. On 

20 hundred percent of yeast expressing GUS-IF were found to turn blue in the presence of the 
X-glu substrate while none of the control yeast turned blue. None of the yeast containing 
the GUS-OF turned blue in the presence of the X-glu substrate under normal growth 
conditions. 

To demonstrate the ability of chemicals to block MMR in yeast, GUS-OF and 
25 control cells were incubated with 300 uM DMA, EMS, or no chemical for 48 hours. After 
incubation, yeast were plated on YPD-ZEO-X-GLU plates and grown at 30°C for 16 hours. 
After incubation, a subset of yeast expressing GUS-OF contain blue subclones, while 
none are seen in EMS or control cells. These data demonstrate the ability of chemicals to 
block MMR of microbes in vivo to produce subclones with new output traits. 


30 
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EXAMPLE 8: Classes of other chemicals capable of blocking MMR in vivo 

The discovery of anthracene compounds presents a new method for blocking MMR 
activity of host organisms in vivo. "While 9,10-dimethylanthracene (DMA) was found to 
block MMR in cell hosts, other analogs with a similar chemical composition from this 
5 class are also claimed in this invention. These include anthracene and related analogs such 
as 9,10-diphenylanthracene and 9,10-di-M-tolylanthracene. Myers et al. ((1988) Biochem. 
Biophys. Res. Commun. 151:1441-1445) disclosed that at high concentrations, DMA acts 
as a potent weak mutagen, while metabolized forms of DMA are the "active" ingredients 
in promoting mutation. This finding suggests that metabolites of anthracene-based 

10 compounds may also act as active inhibitors of MMR in vivo. For instance, metabolism of 
anthracene and 9,10-dimethylanthracene by Micrococcus sp., Pseudomonas sp. and 
Bacillus macerans microbes have found a number of anthracene and 9,10- 
dimethylanthracene metabolites are formed. These include anthracene and 9,10- 
dimethylanthracene cis-dihydrodiols, hydroxy-methyl-derivatives and various phenolic 

15 compounds. Bacteria metabolize hydrocarbons using the dioxygenase enzyme system, 
which differs from the mammalian cytochrome P-450 monoxygenase. These findings 
suggest the use of bacteria for biotransfoiming anthracene and DMA for additional MMR 
blocking compounds (Traczewska, T.M. etal. (1991) Acta. Microbiol. Pol. 40:235-241). 
Metabolism studies of DMA by rat-liver microsomal preparations has found that this 

20 molecule is converted to 9-Hydroxymethyl-lO-methylanthracene (9-OHMeMA) and 9,10- 
dihydroxymethyl-anthracene (9,10-DiOHMeA) (Lamparczyk, H.S. et al (1984) 
Carcinogenesis 5:1405-1410). In addition, the 1rans-l,2-dihydro-l,2-dihydroxy derivative 
of DMA (DMA 1,2-diol) was found to be a major metabolite as determined by 
chromatographic, ultraviolet (UV), nuclear magnetic resonance (NMR), and mass spectral 

25 properties. DMA 1 ,2-diol was also created through the oxidation of DMA in an ascorbic 
acid-ferrous sulfate-EDTA system. Other dihydrodiols that are formed from DMA by 
metabolism are the trans-1,2- and 3,4-dihydrodiols of 9-OHMeMA (9-OHMeMA 1,2-diol 
and 9-OHMeMA 3,4-diol) while the further metabolism of DMA 1,2-diol can yield both of 
these dihydrodiols. Finally, when 9-OHMeMA is further metabolized, two main 

30 metabolites are formed; one was identified as 9,10-DiOHMeA and the other appeared to be 
9-OHMeMA 3,4-diol. 
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The metabolism of 9-methylanthracene (9-MA), 9-hydroxymethylanthracene (9- 
OHMA), and 9,10-dimethylanthracene (9,10-DMA) by fungus also has been reported 
(Cerniglia, C.E. et al. (1990) Appl. Environ. Microbiol. 56:661-668). These compounds 
are also useful for generating DMA derivatives capable of blocking MMR. Compounds 9- 
5 MA and 9,10-DMA are metabolized by two pathways, one involving initial hydroxylation 
of the methyl group(s) and the other involving epoxidation of the 1,2- and 3,4- aromatic 
double bond positions, followed by enzymatic hydration to form hydroxymethyl trans- 
dihydrodiols. For 9-MA metabolism, the major metabolites identified are trans- 1,2- 
dihydro-l,2-dihydroxy and trans-3,4-dihydro-3,4-dihydroxy derivatives of 9-MA and 9- 

10 OHMA, whereby 9-OHMA can be further metabolized to trans-1,2- and 3,4-dihydrodiol 
derivatives. Circular dichroism spectral analysis revealed that the major enantiomer for 
each dihydrodiol was predominantly in the S,S configuration, in contrast to the 
predominantly RR configuration of the trans-dihydrodiol formed by mammalian enzyme 
systems. These results indicate that Caenorhabditis elegans metabolizes methylated 

15 anthracenes in a highly stereoselective manner that is different from that reported for rat 
liver microsomes. 

The analogs as listed above provide an example but are not limited to anthracene- 
derived compounds capable of ehciting MMR blockade. Additional analogs that are of 
potential use for blocking MMR are shown in Fig.8. 

20 

Other classes of small molecular weight compounds that are capable of blocking 
MMR in vivo. 

MMR is a multi-step process that involves the formation of protein complexes that 
detect mismatched bases or altered repetitive sequences and interface these mutations with 

25 enzymes that degrade the mutant base and repair the DNA with correct nucleotides. First, 
mismatched DNA is recognized by the mutS heterodimeric complex consisting of MSH2 
and GTBP proteins. The DNA bound mutS complex is then recognized by the mutL 
heterdimeric complex that consists of PMS2 and MLH1 proteins. The mutL complex is 
thought to interface exonucleases with the mismatched DNA site, thus initiating this 

30 specialized DNA repair process. After the mismatched bases are removed, the DNA is 
repaired with a polymerase. 
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There are several steps in the normal process that can be targeted by small 
molecular weight compounds to block MMR. This application teaches of these steps and 
the types of compounds that may be used to block this process. 

5 ATPase inhibitors: 

The finding that nonhydrolyzable forms of ATP are able to suppress MMR in vitro 
also suggest that the use for this type of compound can lead to blockade of MMR in vivo 
and mutation a host organism's genome (Galio, L. et al. (1999) Nucl. Acids Res. 27:2325- 
2331; Allen, D.J. et al. (1997) EMBO J. 16:4467-4476; Bjornson, K.P. et al. (2000) 
10 Biochem. 39:3176-3183). One can use a variety of screening methods described within 
this application to identify ATP analogs that block the ATP-dependent steps of mismatch 
repair z'n, vivo. 


Nuclease inhibitors: 

15 The removal of mismatched bases is a required step for effective MMR (Harfe, 

B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399). This suggests that 
compounds capable of blocking this step can lead to blockade of MMR in vivo and 
mutation a host organism's genome. One can use a variety of screening methods described 
within this application to identify nuclease inhibitors analogs that block the nuclease steps 

20 of mismatch repair in vivo. An example of the types of nuclease inhibitors are but not 
limited to analogs of N-Ethylmaleimide, an endonuclease inhibitor (Huang, Y.C., etal. 
(1995) Arch. Biochem. Biophys. 316:485), heterodimeric adenme-chain-acridine 
compounds, exonulcease JU inhibitors (Belmont P, etal., BioorgMed Chem Lett (2000) 
10:293-295), as well as antibiotic compounds such as HeUquinomycin, which have 

25 helicase inhibitory activity (Chino, M, etal. J. Antibiot. (Tokyo) (1998) 51:480-486). 


Polymerase inhibitors: 

Short and long patch repair is a required step for effective MMR (Modrich, P. 
(1994) Science 266:1959-1960). This suggests that compounds capable of blocking 
30 MMR-associated polymerization can lead to blockade of MMR in vivo and mutation a host 
organism's genome. One can use a variety of screening methods described within this 
application to identify polymerase inhibitors analogs that block the polymerization steps of 
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mismatch repair in vivo. An example of DNA polymerase inhibitors that are useful in 
blocking MMR activity include, but are not limited to, analogs of actinomycin D (Martin, 
S.J., et.al. (1990) J. Immunol. 145:1859), Aphidicolin (Kuwakado, K. et.al. (1993) ' 
Biochem. Pharmacol. 46:1909) l-(2'-Deoxy-2 , -fluoro-beta-L-arabmofuranosyl)-5- 
methyluracil (L-FMAU) (Kukhanova M, etal., Biochem Pharmacol (1998) 55:1181- 
1187), and 2 , ,3'-dideoxyribonucleoside 5'-triphosphates (ddNTPs) (Ono, K., etal., Biomed 
Pharmacother (1984) 38:382-389). 

Chemical Inhibitors of Mismatch Repair Gene Expression 

MMR is a multi-protein process that requires the cooperation of several proteins 
such as but not limited to mutS homologs, MSH2, MSH3, MSH6, GTBP; mutL homologs 
PMS1, PMS2, MLH1; and exonucleases and helicases such as MutH and MutY (Harfe, 
B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399; Modrich, P. (1994) 
Science 266: 1959-1960). Chemicals capable of blocking the expression of these genes can 
lead to the blockade of MMR. An example of a chemical that is capable of blocking 
MMR gene expression is an oligodeoxynucleotide that can specifically bind and degrade 
an MMR gene message and protein production as described by Chauhan DP, et.al. (Clin 
Cancer Res (2000) 6:3827-383 1). One can use a variety of screening methods described 
within this application to identify inhibitors mat block the expression and/or function of 
MMR genes in vivo. 

DISCUSSION 

The results described herein demonstrate the use of chemicals that can block 
mismatch repair of host organisms in vivo to produce genetic mutations. The results also 
demonstrate the use of reporter systems in host cells and organisms that are useful for 
screening chemicals capable of blocking MMR of the host organism. Moreover, the 
results demonstrate the use of chemical inhibitors to block MMR in mammalian cells, 
microbes, and plants to produce organisms with new output traits. The data presented 
herein provide novel approaches for producing genetically altered plants, microbes, and 
mammalian cells with output traits for commercial applications by inhibiting MMR with 
chemicals. This approach gives advantages over others that require the use of recombinant 
techniques to block MMR or to produce new output traits by expression of a foreign gene. 
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This method will be useful in producing genetically altered host organisms for 
agricultural, chemical manufacturing, pharmaceutical, and environmental applications. 


PMS2 (mouse) (SEQ ID NO: 14) 

5 MEQTEGVSTE CAKAIKPIDG KSVHQICSGQ VILSLSTAVK ELIENSVDAG ATTIDLRLKD 60 
YGVDLIEVSD NGCGVEEENF EGLALKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 
TISTCHGSAS VGTRLVFDHN GKITQKTPYP RPKGTTVSVQ HLFYTLPVRY KEFQRNIKKE 180 
YSKMVQVLQA YCIISAGVRV SCTNQLGQGK RHAWCTSGT SGMKENIGSV FGQKQLQSLI 240 
PFVQLPPSDA VCEEYGLSTS GRHKTFSTFR AS FHSARTAP GGVQQTGSFS SSIRGPVTQQ 300 

10 RSLSLSMRFY HMYNRHQYPF WLNVSVDSE CVDINVTPDK RQILLQEEKL LLAVLKTSLI 360 
GMFDSDANKL NVNQQPLLDV EGNLVKLHTA ELEKPVPGKQ DNSPSLKSTA DEKRVASISR 420 
LREAFSLHPT KEIKSRGPET AELTRSFPSE KRGVLSSYPS DVISYRGLRG SQDKLVSPTD 480 
SPGDCMDREK IEKDSGLSST SAGSEEEFST PEVASSFSSD YNVSSLEDRP SQETINCGDL 540 
DCRPPGTGQS LKPEDHGYQC KALPLARLSP TNAKRFKTEE RPSNVNISQR LPGPQSTSAA 600 

15 EVDVAIKMNK R1VLLEFSLS SLAKRMKQLQ HLKAQNKHEL SYRKFRAK1C PGENQAAEDE 660 
LRKEISKSMF AEMEILGQFN LGFIVTKLKE DLFLVDQHAA DEKYNFEMLQ QHTVLQAQRL 720 
ITPQTLNLTA VNEAVLIENL EIFRKNGFDF VIDEDAPVTE RAKLISLPTS KNWTFGPQDI 780 
DELIFMLSDS PGVMCRPSRV RQMFASRACR KSVMIGTALN ASEMKKLITH MGEMDHPWNC 840 
PHGRPTMRHV ANLDVISQN 859 

20 

PMS2 (mouse cDNA) (SEQ ID NO: 15) 

gaattccggt gaaggtcctg aagaatttcc agattcctga gtatcattgg aggagacaga 60 
taacctgtcg tcaggtaacg atggtgtata tgcaacagaa atgggtgttc ctggagacgc 120 

25 gtcttttccc gagagcggca ccgcaactct cccgcggtga ctgtgactgg aggagtcctg 180 
catccatgga gcaaaccgaa ggcgtgagta cagaatgtgc taaggccatc aagcctattg 240 
atgggaagtc agtccatcaa atttgttctg ggcaggtgat actcagttta agcaccgctg 300 
tgaaggagtt gatagaaaat agtgtagatg ctggtgctac tactattgat ctaaggctta 360 
aagactatgg ggtggacctc attgaagttt cagacaatgg atgtggggta gaagaagaaa 420 

30 actttgaagg tctagctctg aaacatcaca catctaagat tcaagagttt gccgacctca 480 
cgcaggttga aactttcggc tttcgggggg aagctctgag ctctctgtgt gcactaagtg 540 
atgtcactat atctacctgc cacgggtctg caagcgttgg gactcgactg gtgtttgacc 600 
ataatgggaa aatcacccag aaaactccct acccccgacc taaaggaacc acagtcagtg 660 
tgcagcactt attttataca ctacccgtgc gttacaaaga gtttcagagg aacattaaaa 720 

35 aggagtattc caaaatggtg caggtcttac aggcgtactg tatcatctca gcaggcgtcc 780 
gtgtaagctg cactaatcag ctcggacagg ggaagcggca cgctgtggtg tgcacaagcg 840 
gcacgtctgg catgaaggaa aatatcgggt ctgtgtttgg ccagaagcag ttgcaaagcc 900 
tcattccttt tgttcagctg ccccctagtg acgctgtgtg tgaagagtac ggcctgagca 960 
cttcaggacg ccacaaaacc ttttctacgt ttcgggcttc atttcacagt gcacgcacgg 1020 

40 cgccgggagg agtgcaacag acaggcagtt tttcttcatc aatcagaggc cctgtgaccc 1080 
agcaaaggtc tctaagcttg tcaatgaggt tttatcacat gtataaccgg catcagtacc 1140 
catttgtcgt ccttaacgtt tccgttgact cagaatgtgt ggatattaat gtaactccag 1200 
ataaaaggca aattctacta caagaagaga agctattgct ggccgtttta aagacctcct 1260 
tgataggaat gtttgacagt gatgcaaaca agcttaatgt caaccagcag ccactgctag 1320 

45 atgttgaagg taacttagta aagctgcata ctgcagaact agaaaagcct gtgccaggaa 1380 
agcaagataa ctctccttca ctgaagagca cagcagacga gaaaagggta gcatocatct 1440 
ccaggctgag agaggccttt tctcttcatc ctactaaaga gatcaagtct aggggtccag 1500 
agactgctga actgacacgg agttttccaa gtgagaaaag gggcgtgtta tcctcttatc 1560 
cttcagacgt catctcttac agaggcctcc gtggctcgca ggacaaattg gtgagtccca 1620 

50 cggacagccc tggtgactgt atggacagag agaaaataga aaaagactca gggctcagca 1680 
gcacctcagc tggctctgag gaagagttca gcaccccaga agtggccagt agctttagca 1740 
gtgactataa cgtgagctcc ctagaagaca gaccttctca ggaaaccata aactgtggtg 1800 
acctggactg ccgtcctcca ggtacaggac agtccttgaa gccagaagac catggatatc 1860 
aatgcaaagc tctacctcta gctcgtctgt cacccacaaa tgccaagcgc ttcaagacag 1920 

55 aggaaagacc ctcaaatgtc aacatttctc aaagattgcc tggtcctcag agcacctcag 1980 
cagctgaggt cgatgtagcc ataaaaatga ataagagaat cgtgctcctc gagttctctc 2040 
tgagttctct agctaagcga atgaagcagt tacagcacct aaaggcgcag aacaaacatg 2100 
aactgagtta cagaaaattt agggccaaga tttgccctgg agaaaaccaa gcagcagaag 2160 
atgaactcag aaaagagatt agtaaatcga tgtttgcaga gatggagatc ttgggtcagt 2220 

60 ttaacctggg atttatagta accaaactga aagaggacct cttcctggtg gaccagcatg 2280 
ctgcggatga gaagtacaac tttgagatgc tgcagcagca cacggtgctc caggcgcaga 2340 
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ggctcatcac accccagact ctgaacttaa ctgctgtcaa tgaagctgta ctgatagaaa 2400 

atctggaaat attcagaaag aatggctttg actttgtcat tgatgaggat gctccagtca 24 60 

ctgaaagggc taaattgatt tccttaccaa ctagtaaaaa ctggaccttt ggaccccaag 2520 

atatagatga actgatcttt atgttaagtg acagccctgg ggtcatgtgc cggccctcac 2580 

5 gagtcagaca gatgtttgct tccagagcct gtcggaagtc agtgatgatt ggaacggcgc 2640 

tcaatgcgag cgagatgaag aagctcatca cccacatggg tgagatggac cacccctgga 2700 

actgccccca cggcaggcca accatgaggc acgttgccaa tctggatgtc atctctcaga 2760 

actgacacac cccttgtagc atagagttta ttacagattg ttcggtttgc aaagagaagg 2820 

ttttaagtaa tctgattatc gttgtacaaa aattagcatg ctgctttaat gtactggatc 2880 

10 catttaaaag cagtgttaag gcaggcatga tggagtgttc ctctagctca gctacttggg 2940 

tgatccggtg ggagctcatg tgagcccagg actttgagac cactccgagc cacattcatg 3000 

agactcaatt caaggacaaa aaaaaaaaga tatttttgaa gccttttaaa aaaaaa 3056 

PMS2 (human) (SEQ ID NO: 16) 

15 MERAESSSTE PAKAIKPIDR KSVHQICSGQ WLSLSTAVK ELVENSLDAG ATNIDLKLKD 60 

YGVDLIEVSD NGCGVEEENF EGLTLKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 

TISTCHASAK VGTRLMFDHN GKIIQKTPYP RPRGTTVSVQ QLFSTLPVRH KEFQRNIKKE 18 0 

YAKMVQVLHA YC11SAGIRV SCTNQLGQGK RQPWCTGGS PSIKENIGSV FGQKQLQSLI 24 0 

PFVQLPPSDS VCEEYGLSCS DALHNLFYIS GFISQCTHGV GRSSTDRQFF F1NRRPCDPA 300 

20 KVCRLVNEVY HMYNRHQYPF WLNISVDSE CVDINVTPDK RQILLQEEKL LLAVLKTSLI 360 

GMFDSDVNKL NVSQQPLLDV EGNLIKMHAA DLEKPMVEKQ DQSPSLRTGE EKKDVSISRL 420 

REAFSLRHTT ENKPHSPKTP EPRRSPLGQK RGMLSSSTSG AISDKGVLRP QKEAVSSSHG 480 

PSDPTDRAEV EKDSGHGSTS VDSEGFSIPD TGSHCSSEYA ASSPGDRGSQ EHVDSQEKAP 540 

ETDDSFSDVD CHSNQEDTGC KFRVLPQPTN LATPNTKRFK KEEILSSSDI CQKLVNTQDM 600 

25 SASQVDVAVK INKKWPLDF SMSSLAKRIK QLHHEAQQSE GEQNYRKFRA KICPGENQAA 660 

EDELRKEISK TMFAEMEIIG QFNLGFIITK LNEDIFIVDQ HATDEKYNFE MLQQHTVLQG 720 

QRLIAPQTLN LTAVNEAVLI ENLEIFRKNG FDFVIDENAP VTERAKLISL PTSKNWTFGP 780 

QDVDELIFML SDSPGVMCRP SRVKQMFASR ACRKSVMIGT ALNTSEMKKL ITHMGEMDHP 840 

WNCPHGRPTM RHIANLGVIS QN 862 

30 

PMS2 (human cDNA) (SEQ ID NO-.17) 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 

ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 

35 aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 

tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 

caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 

tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 

actcgactga tgtttgatca caatgggaaa attatccaga aaacccccta cccccgcccc 480 

40 agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 540 

tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 600 

atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 660 

cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 720 

cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 780 

45 gaagagtacg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 84 0 

atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttatc 900 

aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 960 

tataatcgac accagtatcc' atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1020 

gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1080 

50 gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1140 

agtcagcagc cactgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1200 

gaaaagccca tggtagaaaa gcaggatcaa tccccttcat taaggactgg agaagaaaaa 1260 

aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1320 

aagcctcaca gcccaaagac tccagaacca agaaggagcc ctctaggaca gaaaaggggt 1380 

55 atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 1440 

gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1500 

gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1560 

agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 1620 

gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat 1680 

60 tcaaaccagg aagataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 17 40 

accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1800 

aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 1860 

aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1920 

catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1980 
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tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2040 

tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2100 

gaggatatct tcatagtgga ccagcatgcc acggacgaga agtataactt cgagatgctg 2160 

cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ctcagactct caacttaact 2220 

gctgttaatg aagctgttct gatagaaaat ctggaaatat ttagaaagaa tggctttgat 2280 

tttgttatcg atgaaaatgc tccagtcact gaaagggcta aactgatttc cttgccaact 2340 

agtaaaaact ggaccttcgg accccaggac gtcgatgaac tgatcttcat gctgagcgac 2400 

agccctgggg tcatgtgccg gccttcccga gtcaagcaga tgtttgcctc cagagcctgc 24 60 

cggaagtcgg tgatgattgg gactgctctt aacacaagcg agatgaagaa actgatcacc 2520 

cacatggggg agatggacca cccctggaac tgtccccatg gaaggccaac catgagacac 2580 

atcgccaacc tgggtgtcat ttctcagaac tgaccgtagt cactgtatgg aataattggt 2640 

tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2700 

atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgagaac 27 60 

cttttcaaac c 2771 

PMS1 (human) (SEQ ID NO: 18) 

MKQLPAA.TVR LLSSSQIITS WSWKELIE NSLDAGATSV DVKLENYGFD KIEVRDNGEG 60 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY GFRGEALGSI CCIAEVLITT RTAADNFSTQ 120 
YVLDGSGHIL SQKPSHLGQG TTVTALRLFK NLPVRKQFYS TAKKCKDEIK KIQDLLMSFG 180 
ILKPDLRIVF VHNKAVIWQK SRVSDHKMAL MSVLGTAVMN NMESFQYHSE ESQIYLSGFL 240 
PKCDADHSFT SLSTPERSFI FINSRPVHQK DILKLIRHHY NLKCLKESTR LYPVFFLKID 300 
VPTADVDVNL TPDKSQVLLQ NKESVL1ALE NLMTTCYGPL PSTNSYENNK TDVSAADIVL 360 
SKTAETDVLF NKVESSGKNY SNVDTSVIPF QNDMHNDESG KNTDDCLNHQ ISIGDFGYGH 420 
CSSEISNIDK NTKNAFQDIS MSNVSWENSQ TEYSKTCFIS SVKHTQSENG NKDHIDESGE 4 80 
NEEEAGLENS SEISADEWSR GNILKNSVGE NIEPVKILVP EKSLPCKVSN NNYPIPEQMN 540 
LNEDSCNKKS NVIDNKSGKV TAYDLLSNRV IKKPMSASAL FVQDHRPQFL IENPKTSLED 600 
ATLQIEELWK TLSEEEKLKY EEKATKDLER YNSQMKRAIE QESQMSLKDG RKKIKPTSAW 660 
NLAQKHKLKT SLSNQPKLDE LLQSQIEKRR SQNIKMVQIP FSMKNLKINF KKQNKVDLEE 720 
KDEPCLIHNL RFPDAWLMTS KTEVMLLNPY RVEEALLFKR LLENHKLPAE PLEKPIMLTE 7 80 
SLFNGSHYLD VLYKMTADDQ RYSGSTYLSD PRLTANGFKI KLIPGVSITE NYLEIEGMAN 840 
- CLPFYGVADL KEILNAILNR NAKEVYECRP RKVISYLEGE AVRLSRQLPM YLSKEDIQDI 900 
IYRMKHQFGN EIKECVHGRP FFHHLTYLPE TT 932 

PMS1 (human) (SEQ ID NO:19) 

ggcacgagtg gctgcttgcg gctagtggat ggtaattgcc tgcctcgcgc tagcagcaag 60 
ctgctctgtt aaaagcgaaa atgaaacaat tgcctgcggc aacagttcga ctcctttcaa 120 
gttctcagat catcacttcg gtggtcagtg ttgtaaaaga gcttattgaa aactccttgg 180 
atgctggtgc cacaagcgta gatgttaaac tggagaacta tggatttgat aaaattgagg 240 
tgcgagataa cggggagggt atcaaggctg ttgatgcacc tgtaatggca atgaagtact 300 
acacctcaaa aataaatagt catgaagatc ttgaaaattt gacaacttac ggttttcgtg 360 
gagaagcctt ggggtcaatt tgttgtatag ctgaggtttt aattacaaca agaacggctg 420 
ctgataattt tagcacccag tatgttttag atggcagtgg ccacatactt tctcagaaac 480 
cttcacatct tggtcaaggt acaactgtaa ctgctttaag attatttaag aatctacctg 540 
taagaaagca gttttactca actgcaaaaa aatgtaaaga tgaaataaaa aagatccaag 600 
atctcctcat gagctttggt atccttaaac ctgacttaag gattgtcttt gtacataaca 660 
aggcagttat ttggcagaaa agcagagtat cagatcacaa gatggctctc atgtcagttc 720 
tggggactgc tgttatgaac aatatggaat cctttcagta ccactctgaa gaatctcaga 780 
tttatctcag tggatttctt ccaaagtgtg atgcagacca ctctttcact agtctttcaa 840 
caccagaaag aagtttcatc ttcataaaca gtcgaccagt acatcaaaaa gatatcttaa 900 
agttaatccg acatcattac aatctgaaat gcctaaagga atctactcgt ttgtatcctg 960 
ttttctttct gaaaatcgat gttcctacag ctgatgttga tgtaaattta acaccagata 1020 
aaagccaagt attattacaa aataaggaat ctgttttaat tgctcttgaa aatctgatga 1080 
cgacttgtta tggaccatta cctagtacaa attcttatga aaataataaa acagatgttt 1140 
ccgcagctga catcgttctt agtaaaacag cagaaacaga tgtgcttttt aataaagtgg 1200 
aatcatctgg aaagaattat tcaaatgttg atacttcagt cattccattc caaaatgata 1260 
tgcataatga tgaatctgga aaaaacactg atgattgttt aaatcaccag ataagtattg 1320 
gtgactttgg ttatggtcat tgtagtagtg aaatttctaa cattgataaa aacactaaga 1380 
atgcatttca ggacatttca atgagtaatg tatcatggga gaactctcag acggaatata 1440 
gtaaaacttg ttttataagt tccgttaagc acacccagtc agaaaatggc aataaagacc 1500 
atatagatga gagtggggaa aatgaggaag aagcaggtct tgaaaactct tcggaaattt 1560 
ctgcagatga gtggagcagg ggaaatatac ttaaaaattc agtgggagag aatattgaac 1620 
ctgtgaaaat tttagtgcct gaaaaaagtt taccatgtaa agtaagtaat aataattatc 1680 
caatccctga acaaatgaat cttaatgaag attcatgtaa caaaaaatca aatgtaatag 1740 
ataataaatc tggaaaagtt acagcttatg atttacttag caatcgagta atcaagaaac 1800 
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ccatgtcagc aagtgctctt tttgttcaag atcatcgtcc tcagtttctc atagaaaatc 1860 
ctaagactag tttagaggat gcaacactac aaattgaaga actgtggaag acattgagtg 1920 
aagaggaaaa actgaaatat gaagagaagg ctactaaaga cttggaacga tacaatagtc 1980 
aaatgaagag agccattgaa caggagtcac aaatgtcact aaaagatggo agaaaaaaga 2040 
D taaaacccac cagcgcatgg aatttggccc agaagcacaa gttaaaaacc tcattatcta 2100 
atcaaccaaa acttgatgaa ctccttcagt cccaaattga aaaaagaagg agtcaaaata 2160 
ttaaaatggt acagatcccc ttttctatga aaaacttaaa aataaatttt aagaaacaaa 2220 1 
acaaagttga cttagaagag aaggatgaac cttgcttgat ccacaatctc aggtttcctg 2280 

m atgcatggct aatgacatcc aaaacagagg taatgttatt aaatccatat agagtagaag 2340 

10 aagccctgct atttaaaaga cttcttgaga atcataaact tcctgcagag ccactggaaa 2400 
agccaattat gttaacagag agtcttttta atggatctca ttatttagac gttttatata 24 60 
aaatgacagc agatgaccaa agatacagtg gatcaactta cctgtctgat cctcgtctta 2520 
cagcgaatgg tttcaagata aaattgatac caggagtttc aattactgaa aattacttgg 2580 
aaatagaagg aatggctaat tgtctcccat tctatggagt agcagattta aaagaaattc 2640 

15 ttaatgctat attaaacaga aatgcaaagg aagtttatga atgtagacct cgcaaagtga 2700 
taagttattt agagggagaa gcagtgcgtc tatccagaca attacccatg tacttatcaa 27 60 
aagaggacat ccaagacatt atctacagaa tgaagcacca gtttggaaat gaaattaaag 2820 
agtgtgttca tggtcgccca ttttttcatc atttaaccta tcttccagaa actacatgat 2880 
taaatatgtt taagaagatt agttaccatt gaaattggtt ctgtcataaa acagcatgag 2940 

20 tctggtttta aattatcttt gtattatgtg tcacatggtt attttttaaa tgaggattca 3000 
ctgacttgtt tttatattga aaaaagttcc acgtattgta gaaaacgtaa ataaactaat 3060 
aac 3 06 3 


25 MSH2 (human) (SEQ ID NO:20) 

MAVQPKETLQ LESAAEVGFV RFFQGMPEKP TTTVRLFDRG DFYTAHGEDA LLAAREVFKT 60 
QGVIKYMGPA GAKNLQSWL SKMNFESFVK DLLLVRQYRV EVYKNRAGNK ASKENDWYLA 120 
YKASPGNLSQ FEDILFGNND MSASIGWGV KMSAVDGQRQ VGVGYVDSIQ RKLGLCEFPD 180 
NDQFSNLEAL LIQIGPKECV LPGGETAGDM GKLRQIIQRG GILITERKKA DFSTKDIYQD 24 0 

30 LNRLLKGKKG EQMNSAVLPE MENQVAVSSL SAVIKFLELL SDDSNFGQFE LTTFDFSQYM 300 
KLDIAAVRAL NLFQGSVEDT TGSQSLAALL NKCKTPQGQR LVNQWIKQPL MDKNRIEERL 360 
NLVEAFVEDA ELRQTLQEDL LRRFPDLNRL AKKFQRQAAN LQDCYRLYQG INQLPNVIQA 420 
LEKHEGKHQK LLLAVFVTPL TDLRSDFSKF QEMIETTLDM DQVENHEFLV KPSFDPNLSE 480 
LREIMNDLEK KMQSTLISAA RDLGLDPGKQ IKLDSSAQFG YYFRVTCKEE KVLRNNKNFS 540 

35 TVDIQKNGVK FTNSKLTSLN EEYTKNKTEY EEAQDAIVKE IVN1SSGYVE PMQTLNDVLA 600 
QLDAWSFAH VSNGAPVPYV RPAILEKGQG RIILKASRHA CVEVQDEIAF IPNDVYFEKD 660 
KQMFH1ITGP NMGGKSTYIR QTGVIVLMAQ IGCFVPCESA EVSIVDCILA RVGAGDSQLK 720 
GVSTFMAEML ETASILRSAT KDSLIIIDEL GRGTSTYDGF GLAWA1SEYI ATKIGAFCMF 780 
ATHFHELTAL ANQIPTVNNL HVTALTTEET LTMLYQVKKG VCDQSFGIHV AELANFPKHV 840 

40 IECAKQKALE LEEFQYIGES QGYDIMEPAA KKCYLEREQG EKIIQEFLSK VKQMPFTEMS 900 
EENITIKLKQ LKAEVIAKNN SFVNEIISRI KVTT 934 


MSH2 (human cDNA) (SEQ ID NO:21) 

ggcgggaaac agcttagtgg gtgtggggtc gcgcattttc ttcaaccagg aggtgaggag 60 

45 gtttcgacat ggcggtgcag ccgaaggaga cgctgcagtt ggagagcgcg gccgaggtcg 120 

gcttcgtgcg cttctttcag ggcatgccgg agaagccgac caccacagtg cgccttttcg 180 

accggggcga cttctatacg gcgcacggcg aggacgcgct gctggccgcc cgggaggtgt 24 0 

tcaagaccca gggggtgatc aagtacatgg ggccggcagg agcaaagaat ctgcagagtg 300 

ttgtgcttag taaaatgaat tttgaatctt ttgtaaaaga tcttcttctg gttcg.tcagt 360 

50 atagagttga agtttataag aatagagctg gaaataaggc atccaaggag aatgattggt 420 

atttggcata taaggcttct cctggcaatc tctctcagtt tgaagacatt ctctttggta 480 

acaatgatat gtcagcttcc attggtgttg tgggtgttaa aatgtccgca gttgatggcc 540 

agagacaggt tggagttggg tatgtggatt ccatacagag gaaactagga ctgtgtgaat 600 

tccctgataa tgatcagttc tccaatcttg aggctctcct catccagatt ggaccaaagg 660 

55 aatgtgtttt acccggagga gagactgctg gagacatggg gaaactgaga cagataattc 720 

aaagaggagg aattctgatc acagaaagaa aaaaagctga cttttccaca aaagacattt 780 

atcaggacct caaccggttg ttgaaaggca aaaagggaga gcagatgaat agtgctgtat 840 

tgccagaaat ggagaatcag gttgcagttt catcactgtc tgcggtaatc aagtttttag 900 

aactcttatc agatgattcc aactttggac agtttgaact gactactttt gacttcagcc 960 

00 agtatatgaa attggatatt gcagcagtca gagcccttaa cctttttcag ggttctgttg 1020 

aagataccac tggctctcag tctctggctg ccttgctgaa taagtgtaaa acccctcaag 1080 

gacaaagact tgttaaccag tggattaagc agcctctcat ggataagaac agaatagagg 1140 

agagattgaa tttagtggaa gcttttgtag aagatgcaga attgaggcag actttacaag 1200 
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aagatttact tcgtcgattc ccagatctta accgacttgc caagaagttt caaagacaag 12 60 
cagcaaactt acaagattgt taccgactct atcagggtat aaatcaacta cctaatgtta 1320 
tacaggctct ggaaaaacat gaaggaaaac accagaaatt attgttggca gtttttgtga 1380 
ctcctcttac tgatcttcgt tctgacttct ccaagtttca ggaaatgata gaaacaactt 1440 
5 tagatatgga tcaggtggaa aaccatgaat tccttgtaaa accttcattt gatcctaatc 1500 
tcagtgaatt aagagaaata atgaatgact tggaaaagaa gatgcagtca acattaataa 1560 
gtgcagccag agatcttggc ttggaccctg gcaaacagat taaactggat tccagtgcac 1620 
agtttggata ttactttcgt gtaacctgta aggaagaaaa agtccttcgt aacaataaaa 1680 
actttagtac tgtagatatc cagaagaatg gtgttaaatt taccaacagc aaattgactt 1740 

1U ctttaaatga agagtatacc aaaaataaaa cagaatatga agaagcccag gatgccattg 1800 
ttaaagaaat tgtcaatatt tcttcaggct atgtagaacc aatgcagaca ctcaatgatg 1860 
tgttagctca gctagatgct gttgtcagct ttgctcacgt gtcaaatgga gcacctgttc 1920 
catatgtacg accagccatt ttggagaaag gacaaggaag aattatatta aaagcatcca 1980 
ggcatgcttg tgttgaagtt caagatgaaa ttgcatttat tcctaatgac gtatactttg 2040 

15 aaaaagataa acagatgttc cacatcatta ctggccccaa tatgggaggt aaatcaacat 2100 
atattcgaca aactggggtg atagtactca tggcccaaat tgggtgtttt gtgccatgtg 2160 
agtcagcaga agtgtccatt gtggactgca tcttagcccg agtaggggct ggtgacagtc 2220 
aattgaaagg agtctccacg ttcatggctg aaatgttgga aactgcttct atcctcaggt 2280 
CtgCaaCGaa a 9 attcatta ataatcatag atgaattggg aagaggaact tctacctacg 2340 

20 atggatttgg gttagcatgg gctatatcag aatacattgc aacaaagatt ggtgcttttt 2400 
gcatgtttgc aacccatttt catgaactta ctgccttggc caatcagata ccaactgtta 2460 
ataatctaca tgtcacagca ctcaccactg aagagacctt aactatgctt tatcaggtga 2520 
agaaaggtgt ctgtgatcaa agttttggga ttcatgttgc agagcttgct aatttcccta 2580 
agcatgtaat agagtgtgct aaacagaaag ecctggaact tgaggagttt cagtatattg 264 0 

25 gagaatcgca aggatatgat atcatggaac cagcagcaaa gaagtgctat ctggaaagag 2700 
agcaaggtga aaaaattatt caggagttcc tgtccaaggt gaaacaaatg ccctttactg 27 60 
aaatgtcaga agaaaacatc acaataaagt taaaacagct aaaagctgaa gtaatagcaa 2820 
agaataatag ctttgtaaat gaaatcattt cacgaataaa agttactacg tgaaaaatcc 2880 
Cagtaatgga at 9 aa 99t aa tattgataag ctattgtctg taatagtttt atattgtttt 2940 

30 atattaaccc tttttccata gtgttaactg tcagtgccca tgggctatca acttaataag 3000 
atatttagta atattttact ttgaggacat tttcaaagat ttttattttg aaaaatgaga 3060 
gctgtaactg aggactgttt gcaattgaca taggcaataa taagtgatgt gctgaatttt 3120 
ataaataaaa tcatgtagtt tgtgg 3145 

35 MLH1 (human) (SEQ ID NO:22) 

MSFVAGVIRR LDETWNRIA AGEVIQRPAN AIKEMIENCL DAKSTSIQVI VKEGGLKLIQ 60 
IQDNGTGIRK EDLDIVCERF TTSKLQSFED LASISTYGFR GEALASISHV AHVTITTKTA 120 
DGKCAYRASY SDGKLKAPPK PCAGNQGTQI TVEDLFYNIA TRRKALKNPS EEYGKILEW 180 
GRYSVHNAGI SFSVKKQGET VADVRTLPNA STVDNIRSIF GNAVSRELIE IGCEDKTLAF 240 

40 KMNGYISNAN YSVKKCIFLL FINHRLVEST SLRKAIETVY AAYLPKNTHP FLYLSLEISP 300 
QNVDVNVHPT KHEVHFLHEE SILERVQQHI ESKLLGSNSS RMYFTQTLLP GLAGPSGEMV 360 
KSTTSLTSSS TSGSSDKVYA HQMVRTDSRE QKLDAFLQPL SKPLSSQPQA IVTEDKTDIS 420 
SGRARQQDEE MLELPAPAEV AAKNQSLEGD TTKGTSEMSE KRGPTSSNPR KRHREDSDVE 4 80 
MVEDDSRKEM TAACTPRRRI INLTSVLSLQ EEINEQGHEV LREMLHNHSF VGCVNPQWAL 540 

45 AQHQTKLYLL NTTKLSEELF YQILIYDFAN FGVLRLSEPA PLFDLAMLAL DSPESGWTEE 600 
DGPKEGLAEY IVEFLKKKAE MLADYFSLEI DEEGNLIGLP LLIDNYVPPL EGLPIFILRL 660 
ATEVNWDEEK ECFESLSKEC AMFYSIRKQY ISEESTLSGQ QSEVPGS1PN SWKWTVEHIV 720 
YKALRSHILP PKHFTEDGNI LQLANLPDLY KVFERC 756 

50 MLH1 (human) (SEQ ID NO:23) 

cttggctctt ctggcgccaa aatgtcgttc gtggcagggg ttattcggcg gctggacgag 60 

acagtggtga accgcatcgc ggcgggggaa gttatccagc ggccagctaa tgctatcaaa 120 

gagatgattg agaactgttt agatgcaaaa tccacaagta ttcaagtgat tgttaaagag 180 

ggaggcctga agttgattca gatccaagac aatggcaccg ggatcaggaa agaagatctg 240 

55 gatattgtat gtgaaaggtt cactactagt aaactgcagt cctttgagga tttagccagt 300 

atttctacct atggctttcg aggtgaggct ttggccagca taagccatgt ggctcatgtt 360 
actattacaa cgaaaacagc tgatggaaag tgtgcataca gagcaagtta ctcagatgga 420 

aaactgaaag cccctcctaa accatgtgct ggcaatcaag ggacccagat cacggtggag 480 

gacctttttt acaacatagc cacgaggaga aaagctttaa aaaatccaag tgaagaatat 540 

60 gggaaaattt tggaagttgt tggcaggtat tcagtacaca atgcaggcat tagtttctca 600 

gttaaaaaac aaggagagac agtagctgat gttaggacac tacccaatgc ctcaaccgtg 660 

gacaatattc gctccatctt tggaaatgct gttagtcgag aactgataga aattggatgt 720 

gaggataaaa ccctagcctt caaaatgaat ggttacatat ccaatgcaaa ctactcagtg 780 
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aagaagtgca tcttcttact cttcatcaac catcgtctgg tagaatcaac ttccttgaga 840 

aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc attcctgtac 900 

ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg tgcaccccac aaagcatgaa 960 

gttcacttcc tgcacgagga gagcatcctg gagcgggtgc agcagcacat cgagagcaag 1020 

ctcctgggct ccaattcctc caggatgtac ttcacccaga ctttgctacc aggacttgct 1080 

ggcccctctg gggagatggt taaatccaca acaagtctga cctcgtcttc tacttctgga 1140 

agtagtgata aggtctatgc ccaccagatg gttcgtacag attcccggga acagaagctt 1200 

gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc agccccaggc cattgtcaca 1260 

gaggataaga cagatatttc tagtggcagg gctaggcagc aagatgagga gatgcttgaa 1320 

ctcccagccc ctgctgaagt ggctgccaaa aatcagagct tggaggggga tacaacaaag 1380 

gggacttcag aaatgtcaga gaagagagga cctacttcca gcaaccccag aaagagacat 1440 

cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaat gactgcagct 1500 

tgtacccccc ggagaaggat cattaacctc actagtgttt tgagtctcca ggaagaaatt 1560 

aatgagcagg gacatgaggt tctccgggag atgttgcata accactcctt cgtgggctgt 1620 

gtgaatcctc agtgggcctt ggcacagcat caaaccaagt tataccttct caacaccacc 1680 

aagcttagtg aagaactgtt ctaccagata ctcatttatg attttgccaa ttttggtgtt 1740 

ctcaggttat cggagccagc accgctcttt gaccttgcca tgcttgcctt agatagtcca 1800 

gagagtggct ggacagagga agatggtccc aaagaaggac ttgctgaata cattgttgag 1860 

tttctgaaga agaaggctga gatgcttgca gactatttct ctttggaaat tgatgaggaa 1920 

gggaacctga ttggattacc ccttctgatt gacaactatg tgcccccttt ggagggactg 1980 

cctatcttca ttcttcgact agccactgag gtgaattggg acgaagaaaa ggaatgtttt 2040 

gaaagcctca gtaaagaatg cgctatgttc tattccatcc ggaagcagta catatctgag 2100 

gagtcgaccc tctcaggcca gcagagtgaa gtgcctggct ccattccaaa ctcctggaag 2160 

tggactgtgg aacacattgt ctataaagcc ttgcgctcac acattctgcc tcctaaacat 2220 

ttcacagaag atggaaatat cctgcagctt gctaacctgc ctgatctata caaagtcttt 2280 

gagaggtgtt aaatatggtt atttatgcac tgtgggatgt gttcttcttt ctctgtattc 2340 

cgatacaaag tgttgtatca aagtgtgata tacaaagtgt accaacataa gtgttggtag 2400 

cacttaagac ttatacttgc cttctgatag tattccttta tacacagtgg attgattata 2460 

aataaataga tgtgtcttaa cata 2484 

hPMS2-134 (human) (SEQ ID NO:24) 

MERAESSSTE PAKAIKPIDR KSVHQICSGQ VVLSLSTAVK ELVENSLDAG ATNIDLKLKD 60 

YGVDLIEVSD NGCGVEEENF EGLTLKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 

TISTCHASAK VGT 133 

hPMS2-134 (human cDNA) (SEQ ID NO:25) 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgot 60 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 

ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 

aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 

tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 

caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 

tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 
acttga 426 

GTBP (human) (SEQ ID NO:26) 

MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAAPGA SPSPGGDAAW SEAGPGPRPL 60 

ARSASPPKAK NLNGGLRRSV APAAPTSCDF SPGDLVWAKM EGYPWWPCLV YNHPFDGTFI 120 

REKGKSVRVH VQFFDDSPTR GWVSKRLLKP YTGSKSKEAQ KGGHFYSAKP EILRAMQRAD 180 

EALNKDKIKR LELAVCDEPS EPEEEEEMEV GTTYVTDKSE EDNEIESEEE VQPKTQGSRR 240 

SSRQIKKRRV ISDSESDIGG SDVEFKPDTK EEGSSDEISS GVGDSESEGL NSPVKVARKR 300 

KRMVTGNGSL KRKSSRKETP SATKQATS1S SETKNTLRAF SAPQNSESQA HVSGGGDDSS 360 

RPTVWYHETL EWLKEEKRRD EHRRRPDHPD FDASTLYVPE DFLNSCTPGM RKWWQ1KSQN 420 

FDLVICYKVG KFYELYHMDA LIGVSELGLV FMKGNWAHSG FPEIAFGRYS DSLVQKGYKV 480 

ARVEQTETPE MMEARCRKMA H1SKYDRWR REICRI1TKG TQTYSVLEGD PSENYSKYLL 540 

SLKEKEEDSS GHTRAYGVCF VDTSLGKFFI GQFSDDRHCS RFRTLVAHYP PVQVLFEKGN 600 

LSKETKTILK SSLSCSLQEG LIPGSQFWDA SKTLRTLLEE EYFREKLSDG IGVMLPQVLK 660 

GMTSESDSIG LTPGEKSELA LSALGGCVFY LKKCLIDQEL LSMANFEEYI PLDSDTVSTT 720 

RSGAIFTKAY QRMVLDAVTL NNLEIFLNGT NGSTEGTLLE RVDTCHTPFG KRLLKQWLCA 780 
PLCNHYAIND RLDAIEDLMV VPDKISEWE LLKKLPDLER LLSKIHNVGS PLKSQNHPDS 840 
RAIMYEETTY SKKKIIDFLS ALEGFKVMCK IIGIMEEVAD GFKSKILKQV ISLQTKNPEG 900 
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RFPDLTVELN RWDTAFDHEK ARKTGLITPK AGFDSDYDQA LADIRENEQS LLEYLEKQRN 960 

RIGCRTIVYW GIGRNRYQLE IPENFTTRNL PEEYELKSTK KGCKRYWTKT IEKKLANLIN 1020 

AEERRDVSLK DCMRRLFYNF DKNYKDWQSA VECIAVLDVL LCLANYSRGG DGPMCRPVIL 1080 

LPEDTPPFLE LKGSRHPCIT KTFFGDDFIP NDILIGCEEE EQENGKAYCV LVTGPNMGGK 1140 

5 STLMRQAGLL AVMAQMGCYV PAEVCRLTPI DRVFTRLGAS DRIMSGESTF FVELSETASI 1200 

LMHATAHSLV LVDELGRGTA TFDGTA1ANA WKELAETIK CRTLFSTHYH SLVEDYSQNV 1260 

AVRLGHMACM VENECEDPSQ ETITFLYKFI KGACPKSYGF NAARLANLPE EVIQKGHRKA 1320 

REFEKMNQSL RLFREVCLAS ERSTVDAEAV HKLLTLIKEL 1360 

10 GTBP (human cDNA) (SEQ ID NO:27) 

gccgcgcggt agatgcggtg cttttaggag ctccgtccga cagaacggtt gggccttgcc 60 
ggctgtcggt atgtcgcgac agagcaccct gtacagcttc ttccccaagt ctccggcgct 120 
gagtgatgcc aacaaggcct cggccagggc ctcacgcgaa ggcggccgtg ccgccgctgc 180 
ccccggggcc tctccttccc caggcgggga tgcggcctgg agcgaggctg ggcctgggcc 240 

15 caggcccttg gcgcgctccg cgtcaccgcc caaggcgaag aacctcaacg gagggctgcg 300 
gagatcggta gcgcctgctg cccccaccag ttgtgacttc tcaccaggag atttggtttg 360 
ggccaagatg gagggttacc cctggtggcc ttgtctggtt tacaaccacc cctttgatgg 420 
aacattcatc cgcgagaaag ggaaatcagt ccgtgttcat gtacagtttt ttgatgacag 480 
cccaacaagg ggctgggtta gcaaaaggct tttaaagcca tatacaggtt caaaatcaaa 540 

20 ggaagcccag aagggaggtc atttttacag tgcaaagcct gaaatactga gagcaatgca 600 
acgtgcagat gaagccttaa ataaagacaa gattaagagg cttgaattgg cagtttgtga 660 
tgagccctca gagccagaag aggaagaaga gatggaggta ggcacaactt acgtaacaga 720 
taagagtgaa gaagataatg aaattgagag tgaagaggaa gtacagccta agacacaagg 780 
atctaggcga agtagccgcc aaataaaaaa acgaagggtc atatcagatt ctgagagtga 840 

25 cattggtggc tctgatgtgg aatttaagcc agacactaag gaggaaggaa gcagtgatga 900 
aataagcagt ggagtggggg atagtgagag tgaaggcctg aacagccctg tcaaagttgc 960 
tcgaaagcgg aagagaatgg tgactggaaa tggctctctt aaaaggaaaa gctctaggaa 1020 
ggaaacgccc tcagccacca aacaagcaac tagcatttca tcagaaacca agaatacttt 1080 
gagagctttc tctgcccctc aaaattctga atcccaagcc cacgttagtg gaggtggtga 1140 

30 tgacagtagt cgccctactg tttggtatca tgaaacttta gaatggctta aggaggaaaa 1200 
gagaagagat gagcacagga ggaggcctga tcaccccgat tttgatgcat ctacactcta 1260 
tgtgcctgag gatttcctca attcttgtac tcctgggatg aggaagtggt ggcagattaa 1320 
gtctcagaac tttgatcttg tcatctgtta caaggtgggg aaattttatg agctgtacca 1380 
catggatgct cttattggag tcagtgaact ggggctggta ttcatgaaag gcaactgggc 1440 

35 ccattctggc tttcctgaaa ttgcatttgg ccgttattca gattccctgg tgcagaaggg 1500 
ctataaagta gcacgagtgg aacagactga gactccagaa atgatggagg cacgatgtag 1560 
aaagatggca catatatcca agtatgatag agtggtgagg agggagatct gtaggatcat 1620 
taccaagggt acacagactt acagtgtgct ggaaggtgat ccctctgaga actacagtaa 1680 
gtatcttctt agcctcaaag aaaaagagga agattcttct ggccatactc gtgcatatgg 1740 

40 tgtgtgcttt gttgatactt cactgggaaa gtttttcata ggtcagtttt cagatgatcg 1800 
ccattgttcg agatttagga ctctagtggc acactatccc ccagtacaag ttttatttga 1860 
aaaaggaaat ctctcaaagg aaactaaaac aattctaaag agttcattgt cctgttctct 1920 
tcaggaaggt ctgatacccg gctcccagtt ttgggatgca tccaaaactt tgagaactct 1980 
ccttgaggaa gaatatttta gggaaaagct aagtgatggc attggggtga tgttacccca 2040 

45 ggtgcttaaa ggtatgactt cagagtctga ttccattggg ttgacaccag gagagaaaag 2100 
tgaattggcc ctctctgctc taggtggttg tgtcttctac ctcaaaaaat gccttattga 2160 
tcaggagctt ttatcaatgg ctaattttga agaatatatt cccttggatt ctgacacagt 2220 
cagcactaca agatctggtg ctatcttcac caaagcctat caacgaatgg tgctagatgc 2280 
agtgacatta aacaacttgg agatttttct gaatggaaca aatggttcta ctgaaggaac 2340 

50 cctactagag agggttgata cttgccatac tccttttggt aagcggctcc taaagcaatg 2400 
gctttgtgcc ccactctgta accattatgc tattaatgat cgtctagatg ccatagaaga 24 60 
cctcatggtt gtgcctgaca aaatctccga agttgtagag cttctaaaga agcttccaga 2520 
tcttgagagg ctactcagta aaattcataa tgttgggtct cccctgaaga gtcagaacca 2580 
cccagacagc agggctataa tgtatgaaga aactacatac agcaagaaga agattattga 2640 

55 ttttctttct gctctggaag gattcaaagt aatgtgtaaa attataggga tcatggaaga 2700 
agttgctgat ggttttaagt ctaaaatcct taagcaggtc atctctctgc agacaaaaaa 2760 
tcctgaaggt cgttttcctg atttgactgt agaattgaac cgatgggata cagcctttga 2820 
ccatgaaaag gctcgaaaga ctggacttat tactcccaaa gcaggctttg actctgatta 2880 
tgaccaagct cttgctgaca taagagaaaa tgaacagagc ctcctggaat acctagagaa 2940 

60 acagcgcaac agaattggct gtaggaccat agtctattgg gggattggta ggaaccgtta 3000 
ccagctggaa attcctgaga atttcaccac tcgcaatttg ccagaagaat acgagttgaa 3060 
atctaccaag aagggctgta aacgatactg gaccaaaact attgaaaaga agttggctaa 3120 
tctcataaat gctgaagaac ggagggatgt atcattgaag gactgcatgc ggcgactgtt 3180 
ctataacttt gataaaaatt acaaggactg gcagtctgct gtagagtgta tcgcagtgtt 3240 
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ggatgtttta ctgtgcctgg ctaactatag tcgagggggt gatggtccta tgtgtcgccc 3300 
agtaattctg ttgccggaag ataccccccc cttcttagag cttaaaggat cacgccatcc 3360 
ttgcattacg aagacttttt ttggagatga ttttattcct aatgacattc taataggctg 3420 
tgaggaagag gagcaggaaa atggcaaagc ctattgtgtg cttgttactg gaccaaatat 34 80 
5 ggggggcaag tctacgctta tgagacaggc tggcttatta gctgtaatgg cccagatggg 3540 
ttgttacgtc cctgctgaag tgtgcaggct cacaccaatt gatagagtgt ttactagact 3600 
tggtgcctca gacagaataa tgtcaggtga aagtacattt tttgttgaat taagtgaaac 3660 
tgccagcata ctcatgcatg caacagcaca ttctctggtg cttgtggatg aattaggaag 3720 
aggtactgca acatttgatg ggacggcaat agcaaatgca gttgttaaag aacttgctga 3780 

10 gactataaaa tgtcgtacat tattttcaac tcactaccat tcattagtag aagattattc 3840 
tcaaaatgtt gctgtgcgcc taggacatat ggcatgcatg gtagaaaatg aatgtgaaga 3900 
ccccagccag gagactatta cgttcctcta taaattcatt aagggagctt gtcctaaaag 3960 
ctatggcttt aatgcagcaa ggcttgctaa tctcccagag gaagttattc aaaagggaca 4020 
tagaaaagca agagaatttg agaagatgaa tcagtcacta cgattatttc gggaagtttg 4080 

15 cctggctagt gaaaggtcaa ctgtagatgc tgaagctgtc cataaattgc tgactttgat 4140 
taaggaatta tagactgact acattggaag ctttgagttg acttctgaca aaggtggtaa 4200 
attcagacaa cattatgatc taataaactt tattttttaa aaat 4244 


MSH3 (human) (SEQ ID NO:28) 

20 MSRRKPASGG LAASSSAPAR QAVLSRFFQS TGSLKSTSSS TGAADQVDPG AAAAAAPPAP 60 
AFPPQLPPHV ATEIDRRKKR PLENDGPVKK KVKKVQQKEG GSDLGMSGNS EPKKCLRTRN 120 
VSKSLEKLKE FCCDSALPQS RVQTESLQER FAVLPKCTDF DDISLLHAKN AVSSEDSKRQ 180 
INQKDTTLFD LSQFGSSNTS HENLQKTASK SANKRSKSIY TPLELQYIEM KQQHKDAVLC 240 
VECGYKYRFF GEDAEIAARE LNIYCHLDHN FMTASIPTHR LFVHVRRLVA KGYKVGWKQ 300 

25 TETAALKAIG DNRSSLFSRK LTALYTKSTL IGEDVNPL1K LDDAVNVDEI MTDTSTSYLL. 360 
CISENKENVR DKKKGNIFIG IVGVQPATGE WFDSFQDSA SRSELETRMS SLQPVELLLP 420 
SALSEQTEAL IHRATSVSVQ DDRIRVERMD NIYFEYSHAF QAVTEFYAKD TVDIKGSQI1 480 
SGIVNLEKPV ICSLAAIIKY LKEFNLEKML SKPENFKQLS SKMEFMTING TTLRNLEILQ 540 
NQTDMKTKGS LLWVLDHTKT SFGRRKLKKW VTQPLLKLRE INARLDAVSE VLHSESSVFG 600 

30 QIENHLRKLP D1ERGLCSIY HKKCSTQEFF LIVKTLYHLK SEFQAIIPAV NSHIQSDLLR 660 
TVILEIPELL SPVEHYLKIL NEQAAKVGDK TELFKDLSDF PLIKKRKDEI QGVIDEIRMH 720 
LQEIRKILKN PSAQYVTVSG QEFM1EIKNS AVSCIPTDWV KVGSTKAVSR FHSPFIVENY 780 
RHLNQLREQL VLDCSAEWLD FLEKFSEHYH SLCKAVHHLA TVDCIFSLAK VAKQGDYCRP 840 
TVQEERKIVI KNGRHPVIDV LLGEQDQYVP NNTDLSEDSE RVMIITGPNM GGKSSYIKQV 900 

35 ALITIMAQIG SYVPAEEATI GIVDGIFTRM GAADNIYKGR STFMEELTDT AEIIRKATSQ 960 
SLVILDELGR GTSTHDGIAI AYATLEYFIR DVKSLTLFVT HYPPVCELEK NYSHQVGNYH 1020 
MGFLVSEDES KLDPGAAEQV PDFVTFLYQI TRGIAARSYG LNVAKLADVP GE1LKRAAHK 1080 
SKELEGLINT KRKRLKYFAK LWTMHNAQDL QKWTEEFNME ETQTSLLH 1128 

40 MSH3 (human DNA) (SEQ ID NO:29) 

gggcacgagc cctgccatgt ctcgccggaa gcctgcgtcg ggcggcctcg ctgcctccag 60 
ctcagcccct gcgaggcaag cggttttgag ccgattcttc cagtctacgg gaagcctgaa 120 
atccacctcc tcctccacag gtgcagccga ccaggtggac cctggcgctg cagcggccgc 180 
agcgccccca gcgcccgcct tcccgcccca gctgccgccg cacgtagcta cagaaattga 240 

45 cagaagaaag aagagaccat tggaaaatga tgggcctgtt aaaaagaaag taaagaaagt 300 
ccaacaaaag gaaggaggaa gtgatctggg aatgtctggc aactctgagc caaagaaatg 360 
tctgaggacc aggaatgttt caaagtctct ggaaaaattg aaagaattct gctgcgattc 420 
tgcccttcct caaagtagag tccagacaga atctctgcag gagagatttg cagttctgcc 480 
aaaatgtact gattttgatg atatcagtct tctacacgca aagaatgcag tttcttctga 540 

50 agattcgaaa cgtcaaatta atcaaaagga cacaacactt tttgatctca gtcagtttgg 600 
atcatcaaat acaagtcatg aaaatttaca gaaaactgct tccaaatcag ctaacaaacg 660 
gtccaaaagc atctatacgc cgctagaatt acaatacata gaaatgaagc agcagcacaa 720 
agatgcagtt ttgtgtgtgg aatgtggata taagtataga ttctttgggg aagatgcaga 780 
gattgcagcc cgagagctca atatttattg ccatttagat cacaacttta tgacagcaag 840 

55 tatacctact cacagactgt ttgttcatgt acgccgcctg gtggcaaaag gatataaggt 900 
gggagttgtg aagcaaactg aaactgcagc attaaaggcc attggagaca acagaagttc 960 
actcttttcc cggaaattga ctgcccttta tacaaaatct acacttattg gagaagatgt 1020 
gaatccccta atcaagctgg atgatgctgt aaatgttgat gagataatga ctgatacttc 1080 
taccagctat cttctgtgca tctctgaaaa taaggaaaat gttagggaca aaaaaaaggg 1140 

60 caacattttt attggcattg tgggagtgca gcctgccaca ggcgaggttg tgtttgatag 1200 
tttccaggac tctgcttctc gttcagagct agaaacccgg atgtcaagcc tgcagccagt 1260 
agagctgctg cttccttcgg ccttgtccga gcaaacagag gcgctcatcc acagagccac 1320 
atctgttagt gtgcaggatg acagaattcg agtcgaaagg atggataaca tttattttga 1380 
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atacagccat gctttccagg 
aggttctcaa attatttctg 
tgccatcata aaatacctca 
ttttaaacag ctatcaagta 
5 tctggaaatc ctacagaatc 
agaccacact aaaacttcat 
ccttaaatta agggaaataa 
atctagtgtg tttggtcaga 
actctgtagc atttatcaca 

10 tttatatcac ctaaagtcag 
gtcagacttg ctccggaccg 
ttacttaaag atactcaatg 
agacctttct gacttccctt 
cgagatccga atgcatttgc 

15 tgtgacagta tcaggacagg 
accaactgat tgggtaaagg 
tattgtagaa aattacagac 
tgctgaatgg cttgattttc 
agtgcatcac ctagcaactg 

20 agattactgc agaccaactg 
ccctgtgatt gatgtgttgc 
atcagaggac tcagagagag 
ctacataaaa caagttgcat 
agaagaagcg acaattggga 

25 tatatataaa ggacggagta 
aaaagcaaca tcacagtcct 
tgatggaatt gccattgcct 
aaccctgttt gtcacccatt 
ggtggggaat taccacatgg 

30 cgcagcagaa caagtccctg 
agcaaggagt tatggattaa 
gaaagcagct cacaagtcaa 
caagtatttt gcaaagttat 
ggagttcaac atggaagaaa 

35 tgaacaaaaa atggagaatt 
tatctttgtg tgacatgtga 
agaggttttt ctgaagacag 
aacactcttg aatagacttc 
aaagccttaa gtggcagaat 

40 tgatattttt atttgtttca 
atccattgaa ctaaaataat 
tttttataag tagaaagaat 
tgggaggcca aggtaggcag 
tggcaaaacc ccatctttac 

45 ttagctgggc atggtggcgc 
aatctcttga acctgggagg 
gcttgggcaa cagagcaaga 
caagctttta aaaactagag 
ttgtcatagg attaagcagt 

50 taataaatat ttaatgaata 


cagttacaga gttttatgca 
gcattgttaa cttagagaag 
aagaattcaa cttggaaaag 
aaatggaatt tatgacaatt 
agactgatat gaaaaccaaa 
ttgggagacg gaagttaaag 
atgcccggct tgatgctgta 
tagaaaatca tctacgtaaa 
aaaaatgttc tacccaagag 
aatttcaagc aataatacct 
ttattttaga aattcctgaa 
aacaagctgc caaagttggg 
taataaaaaa gaggaaggat 
aagaaatacg aaaaatacta 
agtttatgat agaaataaag 
ttggaagcac aaaagctgtg 
atctgaatca gctccgggag 
tagagaaatt cagtgaacat 
ttgactgcat tttctccctg 
tacaagaaga aagaaaaatt 
tgggagaaca ggatcaatat 
taatgataat taccggacca 
tgattaccat catggctcag 
ttgtggatgg cattttcaca 
catttatgga agaactgact 
tggttatctt ggatgaacta 
atgctacact tgagtatttc 
atccgccagt ttgtgaacta 
gattcttggt cagtgaggat 
attttgtcac cttcctttac 
atgtggctaa actagcagat 
aagagctgga aggattaata 
ggacgatgca taatgcacaa 
cacagacttc tcttcttcat 
aaaaatacca actgtacaaa 
gcataaaatt atgaccatgg 
tctttttcaa gtttctgtct 
cactttgtaa ttagaaaatt 
ataattccca agcttttgga 
gttcagataa ttggcaactg 
tttattatgc aaccagttta 
tggccaggca tggtggctca 
atcacctgag gtcaggagtt 
taaaaatata aagtacatct 
acacctgtag tcccagctac 
cggaggttgc aatgagccga 
ctccatctca aaaaagaaaa 
cacagaagga ataaggtcat 
ttaaagattg ttggatgaaa 
cttgctataa aaaaaaaaaa 


aaagatacag ttgacatcaa 1440 
cctgtgattt gctctttggc 1500 
atgctctcca aacctgagaa 1560 
aatggaacaa cattaaggaa 1620 
ggaagtttgc tgtgggtttt 1680 
aagtgggtga cccagccact 1740 
tcggaagttc tccattcaga 1800 
ttgcccgaca tagagagggg 1860 
ttcttcttga ttgtcaaaac 1920 
gctgttaatt cccacattca 1980 
ctcctoagtc cagtggagca 2040 
gataaaactg aattatttaa 2100 
gaaattcaag gtgttattga 2160 
aaaaatcctt ctgcacaata 2220 
aactctgctg tatcttgtat 2280 
agccgctttc actctccttt 2340 
cagctagtcc ttgactgcag 2400 
tatcactcct tgtgtaaagc 24 60 
gccaaggtcg ctaagcaagg 2520 
gtaataaaaa atggaaggca 2580 
gtcccaaata atacagattt 2640 
aacatgggtg gaaagagctc 2700 
attggctcct atgttcctgc 2760 
aggatgggtg ctgcagacaa 2820 
gacacagcag aaataatcag 2880 
ggaagaggga cgagcactca 2940 
atcagagatg tgaaatcctt 3000 
gaaaaaaatt actcacacca 3060 
gaaagcaaac tggatccagg 3120 
caaataacta gaggaattgc 3180 
gttcctggag aaattttgaa 324 0 
aatacgaaaa gaaagagact 3300 
gacctgcaga agtggacaga 3360 
taaaatgaag actacatttg 3420 
ataactctcc agtaacagcc 3480 
tatattccta ttggaaacag 3540 
tcctaacttt tctacgtata 3600 
ttatggacag taagtccagt 3660 
gggtgatata aaaatttact 3720 
ggtgaatctg gcaggaatct 3780 
tccaccaaga acataagaat 3840 
tgcctgtaat cccagcactt 3900 
caagaccagc ctggccaaca 3960 
ctactaaaaa tacgaaaaaa 4020 
tccggaggct gaggcaggag 4080 
gatcacgtca ctgcactcca 4140 
aagaaaagaa atagaattat 4200 
gaaatttaaa aggttaaata 4260 
ttatttgtca ttcattcaag 4320 
aaaaaaaaaa aaaa 4374 


Each reference cited herein is hereby incorporated by reference in its entirety. 
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We claim: 

1 . A method for making a hypermutable cell comprising exposing a cell to an inhibitor 
of mismatch repair, wherein said inhibitor is an anthracene, an ATPase inhibitor, a nuclease 
inhibitor, a polymerase inhibitor, or an antisense oligonucleotide that specifically hybridizes 
to a nucleotide encoding a mismatch repair protein. 

2. The method of claim 1 wherein said inhibitor is an anthracene. 

3 . The method of claim 2 wherein said anthracene has the formula: 



wherein R r R 10 are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometalhc group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups. 
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4. The method of claim 3 wherein R; and Rg are hydrogen. 

5. The method of claim 3 wherein R,-R 10 are independently hydrogen, hydroxyl, alkyl, 
aryl, arylaklyl, or hydroxyalkyl. 

6. The method of claim 3 wherein R r R 10 are independently hydrogen, hydroxyl, methyl, 
ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, tolyl, hydroxymethyl, hydroxypropyl, or 
hydroxybutyl. 

7. The method of claim 3 wherein said anthracene is selected from the group consisting 
of 1,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 9,10- 
diphenylanthracene, 9,10-dihydroxymethylanthracene, 9-hydroxymethyl-10- 
methylanthracene, dimethylanthracene-l,2-diol, 9-hydroxymethyl-10-methylanthracene-l,2- 
diol, 9-hydroxymethyl-10-methylanthracene-3,4-diol, and 9, 10-di-m-tolyanthracene. 

8. The method of claim 3 wherein R 3 , R,, R 5 , Rg, R 75 R 8 , R, and R I0 are hydrogen. 

9. The method of claim 3 wherein R t , R^ R 3 , R 4 , Rg, R 6 , R 7 and R* are hydrogen. 

10. The method of claim 3 wherein R I5 R^ R 3 , R 4 , R5, R 6 , R 7 and R 8 are hydrogen. 

1 1 . The method of claim 3 wherein Rj, R 2 , R 3 , R 4 , Rj, R 5 , R, and R 10 are hydrogen. 

12. The method of claim 3 wherein R l5 R^ R 3 , R 4 , Rj, Rg, R 7 and R g are hydrogen. 

13. The method of claim 3 wherein R l9 R,, R 3 , R,, R^ R,., R 7 R 8 and R 10 are hydrogen. 

14. The method of claim 1 wherein said ATPase inhibitor is nonhydrolyzable forms of 
ATP such as AMP-PNP. 


15. The method of claim 1 wherein said a nuclease inhibitor is an analog of N- 
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Ethylmaleimide, aheterodimeric adenine-chain-acridine compounds, or aquinilone such 
as Heliquinomycin. 

1 6. The method of claim 1 wherein said polymerase inhibitor is an analog of 
apmflcohXl-(2'-Deoxy-2'-fluoro-beta-L^ 
2 , ,3'-dideoxyribonucleoside5 , -triphosphates. 

17. The method of claim 1 wherein said antisense oligonucleotide comprises about 15 
consecutive nucleotides that are complementary to the coding strand of a mismatch repair 
protein, wherein said antisense oligonucleotide specifically binds to said coding strand of 
said mismatch repair protein under physiological conditions and inhibits mismatch repair 
activity of said mismatch repair protein. 

1 8. The method of claim 17 wherein said antisense oligonucleotide specifically binds 
to a regulatory portion on said coding strand of said mismatch repair protein. 

19. The method of claim 1 7 wherein said antisense oligonucleotide is directed against 
the first six codons of a MMR gene message. 

20. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a eukaryotic cell in vitro. 

21. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a prokaryotic cell in vitro. 

22. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a plant. 

23 . A method for generating a mutation in a gene of interest comprising exposing a cell 
comprising said gene of interest to a chemical mismatch repair inhibitor and testing said 
cell to determine whether said gene of interest comprises a mutation. 


WO 02/054856 


51 


PCT/US01/00934 


24. The method of claim 23 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

25 . The method of claim 23 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 

26. The method of claim 23 wherein said testing comprises analyzing the phenotype of 
said cell. 

27. The method of claim 23 wherein said cell is a mammalian cell, and wherein said 
mammalian cell is made mismatch repair defective by exposing said mammalian cell to an 
inhibitor of mismatch repair. 

28 . The method of claim 27 further comprising removing the chemical inhibitor of 
mismatch repair after determining that said gene of interest comprises a mutation. 

29. The method of claim 27 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

3 0. The method of claim 27 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 

3 1 . The method of claim 27 wherein said testing comprises analyzing the phenotype of 
said cell. 

32. A method for generating a mutation in a gene of interest comprising exposing an 
animal to a chemical inhibitor of mismatch repair and testing said animal to determine 
whether the gene of interest comprises a mutation. 


3 3 . The method of claim 32 wherein said animal is a mammal. 
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34. The method of claim 32 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

35. The method of claim 32 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 

36: The method of claim 32 wherein said testing comprises analyzing the phenotype of 
said cell. 

37. The method of claim 33 wherein said mammal is made mismatch repair defective 
by exposing said mammal to an inhibitor of mismatch repair. 

38. The method of claim 37 further comprising removing said inhibitor of mismatch 
repair after deterrnining that said gene of interest comprises a mutation. 

39. A hypermutable transgenic mammal made by the method of claim 33. 

40. A method for generating a mismatch repair defective plant comprising exposing 
said plant to an inhibitor of mismatch repair. 

41 . A method for generating a mutation in a gene of interest comprising growing a 
plant comprising said gene of interest, exposing said plant to an inhibitor of mismatch 
repair, and testing said plant to determine whether said gene of interest comprises a 
mutation. 

42. The method of claim 41 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

43 . The method of claim 41 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 


44. The method of claim 41 wherein said testing comprises analyzing the phenotype of 
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said plant. 

45. The method of claim 41 wherein said plant is made mismatch repair defective by 
exposing said plant to an inhibitor of mismatch repair. 

46. A hypermutable plant made by the method of claim 40. 

47. The plant of claim 46 wherein said plant is monocot. 

48. The plant of claim 46 wherein said plant is dicot. 

49. A method for screening for chemical inhibitors of mismatch repair comprising 
exposing an organism to a candidate compound and screening the DNA of said organism 
for microsatellite instabihty. 

50. The method of claim 49 wherein said organism is a mammal. 

5 1 . The method of claim 49 wherein said organism is a microbe. 

52. The method of claim 49 wherein said organism is a plant. 

53. The method of claim 49 wherein said screening comprises monitoring endogenous 
microsatellites. 

54. The method of claim 49 wherein said screening comprises the use of reporter 
expression genes, wherein said reporter expression genes comprise polynucleotide repeats 
within a coding region of said reporter gene. 

55. The method of claim 54 wherein said reporter gene is P-glucuronidase. 

56. A method for blocking mismatch repair activity in vivo comprising exposing a cell 
to an anthracene compound. 
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57. The method of claim 56 wherein said anthracene comprises the formula: 


wherein R,-Ri 0 are independently hydrogen, hydroxy!, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups. 

5 8 . The method of claim 57 wherein R^ and R s are hydrogen. 

59. The method of claim 57 wherein RrR 10 are independently hydrogen, hydroxyl, alkyl, 
aryl, arylaklyl, orhydroxyalkyl. 
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60. The method of claim 57 wherein R t -R 10 are independently hydrogen, hydroxyl, methyl, 
ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, tolyl, hydroxymethyl, hydroxypropyl, or 
hydroxybutyl. 

6 1 . The method of claim 57 wherein said anthracene is selected from the group consisting 
of 1,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 9,10- 
diphenylanthracene, 9,10-dihydroxymethylanthracene, 9-hydroxymethyl-10- 
memylanthracene, dimemylant^^^ 

diol, 9-hydroxymethyl-10-methylanthracene-3,4-diol, and 9, 10-di-m-tolyanthracene. 
Rj> R<u 

62. The method of claim 57 wherein R 3 , R 4 , Rj, R 6 , R 7 , R 8 , R, and R 10 are hydrogen. 

63 . The method of claim 57 wherein R„ R,, R 3 , R,, R;, R,, R 7 and R 8 are hydrogen. 

64. The method of claim 57 wherein R l5 R^ R 3 , Rj, Rg, R 7 and R 8 are hydrogen. 

65. The method of claim 57 wherein R I9 R,, R 3 , R 4 , R,, R^, R, and R 10 are hydrogen. 

66. The method of claim 57 wherein R„ R^ R 3 , R 4 , R 5 , Rj, R 7 and R 8 are hydrogen. 

67. The method of claim 57 wherein R l5 R 2 , R 3 , R,, R;, Rg, R 7 R 8 and R !0 are hydrogen. 

68. The method of claim 23 further comprising exposing said cell to a mutagen. 

69. The method of claim 32 further comprising exposing said animal to a mutagen. 

70. The method of claim 68 or 69 wherein said mutagen is selected from the group 
consisting of N-memyl-N^mfro-N-mtrosoguamdine, methane sulfonate, dimethyl 
sulfonate, O-6-methyl benzadine, ethyl methanesulfonate, methylnitrosourea, and 
ethylnitrosourea. 
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71 . The method of claim 49 wherein the chemcial is a MMR inhibitor wherein it 
induces microsatellite instability in MMR proficient cells but does not induce enhanced 
microsattelite instability in MMR deficient cells. 
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<110> Nicolaides, Nicholas C 
Grasso, Luigi 
Sass, Philip M 

<120> CHEMICAL INHIBITORS OF MISMATCH REPAIR 

<130> MOR-0018 

<140> 00/000,000 
<141> 2001-01-15 

<160> 44 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 1 

ggatcctaat acgactcact atagggagac caccatgtcg ttcgtggcag gg 52 

<210> 2 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
primer 

<400> 2 

taagtcttaa gtgctaccaa c 21 

<210> 3 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 3 

ggatcctaat acgactcact atagggagac caccatggaa caattgcctg egg 53 

<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 4 

aggttagtga agactctgtc 20 

<210> 5 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 5 

ctgatctcac ggacaatagt gc 22 

<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 6 

ggctccataa aaagtgcacc 20 
<210> 7 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 7 

ggtctgttga tgtcgtaagt eg 22 

<210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 8 

atcttgaaac ctttagggag gg 22 

<210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 9 

agaagtttag acaggtac 18 

<210> 10 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -.oligonucleotide 
primer 

<400> 10 

3 
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aaatgtgcaa ttgccttc 

<210> 11 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 11 

cccggatcca tgttaaaaaa aaaaaaaaaa aaaaaacgtc ctgtagaaac c 51 

<210> 12 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 12 

cccggatcca tgttaaaaaa aaaaaaaaaa aaaacgtcct gtagaaac 48 

<210> 13 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
primer 

<400> 13 

cccgaattcc ccgatctagt aacatagatg 30 


<210> 14 

<211> 859 

<212> PRT 

<213> Mus musculus 

<400> 14 
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18 
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Met Glu Gin Thr Glu Gly Val Ser Thr Glu Cys Ala Lys Ala lie Lys 
15 10 15 

Pro He Asp Gly Lys Ser Val His Gin He Cys Ser Gly Gin Val He 
20 25 30 

Leu Ser Leu Ser Thr Ala Val Lys Glu Leu He Glu Asn Ser Val Asp 


Leu He Glu Val Ser Asp Asn Gly Cys Gly Val Glu Glu Glu Asn Phe 
65 70 75 80 

Glu Gly Leu Ala Leu Lys His His Thr Ser Lys He Gin Glu Phe Ala 
85 90 95 

Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg Gly Glu Ala Leu Ser 
100 , 105 110 

Ser Leu Cys Ala Leu Ser Asp Val Thr He Ser Thr Cys His Gly Ser 
115 120 125 

Ala Ser Val Gly Thr Arg Leu Val Phe Asp His Asn Gly Lys He Thr 
130 135 140 

Gin Lys Thr Pro Tyr Pro Arg Pro Lys Gly Thr Thr Val Ser Val Gin 
145 . 150 155 160 

His Leu Phe Tyr Thr Leu Pro Val Arg Tyr Lys Glu Phe Gin Arg Asn 
165 170 175 

He Lys Lys Glu Tyr Ser Lys Met Val Gin Val Leu Gin Ala Tyr Cys 
180 185 190 

He He Ser Ala Gly Val Arg Val Ser Cys Thr Asn Gin Leu Gly Gin 
195 200 205 

Gly Lys Arg His Ala Val Val Cys Thr Ser Gly Thr Ser Gly Met Lys 
210 215 220 

Glu Asn He Gly Ser Val Phe Gly Gin Lys Gin Leu Gin Ser Leu He 
225 230 . 235 240 

Pro Phe Val Gin Leu Pro Pro Ser Asp Ala Val Cys Glu Glu Tyr Gly 


35 


40 


45 


Ala Gly Ala Thr Thr He Asp Leu 
50 55 


Arg Leu Lys Asp Tyr Gly Val Asp 
60 


245 


250 


255 
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Leu Ser Thr Ser Gly Arg His Lys Thr Phe Ser Thr Phe Arg Ala Ser 
260 265 270 

Phe His Ser Ala Arg Thr Ala Pro Gly Gly Val Gin Gin Thr Gly Ser 
275 290 285 

Phe Ser Ser Ser He Arg Gly Pro Val Thr Gin Gin Arg Ser Leu Ser 
290 295 300 

Leu Ser Met Arg Phe Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 
305 310 315 320 

Val Val Leu Asn Val Ser Val Asp Ser Glu Cys Val Asp He Asn Val 
325 330 335 

Thr Pro Asp Lys Arg Gin He Leu Leu Gin Glu Glu Lys Leu Leu Leu 
340 345 350 

Ala Val Leu Lys Thr Ser Leu He Gly Met Phe Asp Ser Asp Ala Asn 
355 360 365 

Lys Leu Asn Val Asn Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 
370 375 380 

Val Lys Leu His Thr Ala Glu Leu Glu Lys Pro Val Pro Gly Lys Gin 
385 390 395 400 

Asp Asn Ser Pro Ser Leu Lys Ser Thr Ala Asp Glu Lys Arg Val Ala 
405 410 415 

Ser He Ser Arg Leu Arg Glu Ala Phe Ser Leu His Pro Thr Lys Glu 
420 425 430 

He Lys Ser Arg Gly Pro Glu Thr Ala Glu Leu Thr Arg Ser Phe Pro 
435 440 445 

Ser Glu Lys Arg Gly Val Leu Ser Ser Tyr Pro Ser Asp Val He Ser 
450 455 460 

Tyr Arg Gly Leu Arg Gly Ser Gin Asp Lys Leu Val Ser Pro Thr Asp 
465 470 475 480 

Ser Pro Gly Asp Cys Met Asp Arg Glu Lys He Glu Lys Asp Ser Gly 


Leu Ser Ser Thr Ser Ala Gly Ser Glu Glu Glu Phe Ser Thr Pro Glu 


485 


490 


4 95 


500 


505 


510 
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Val Ala Ser Ser Phe Ser Ser 'Asp Tyr Asn Val Ser Ser Leu Glu Asp 
515 520 525 

Arg Pro Ser Gin "Glu Thr He Asn Cys Gly Asp Leu Asp Cys Arg Pro 
530 535 540 

Pro Gly Thr Gly Gin Ser Leu Lys Pro Glu Asp His Gly Tyr Gin Cys 
545 550 555 560 

Lys Ala Leu Pro Leu Ala Arg Leu Ser Pro Thr Asn Ala Lys Arg Phe 
565 570 575 

Lys Thr Glu Glu Arg Pro Ser Asn Val Asn He Ser Gin Arg Leu Pro 
580 585 590 

Gly Pro Gin Ser Thr Ser Ala Ala Glu Val Asp Val Ala He Lys Met 
595 600 605 

Asn Lys Arg He Val Leu Leu Glu Phe Ser Leu Ser Ser Leu Ala Lys 
610 615 620 

Arg Met Lys Gin Leu Gin His Leu Lys Ala Gin Asn Lys His Glu Leu 
625 630 635 640 

Ser Tyr Arg Lys Phe Arg Ala Lys He Cys Pro Gly Glu Asn Gin Ala 
645 650 655 

Ala Glu Asp Glu Leu Arg Lys Glu He Ser Lys Ser Met Phe Ala Glu 
660 665 670 

Met Glu He Leu Gly Gin Phe Asn Leu Gly Phe He Val Thr Lys Leu 
675 680 685 

Lys Glu Asp Leu Phe Leu Val Asp Gin His Ala Ala Asp Glu Lys Tyr 
690 695 700 

Asn Phe Glu Met Leu Gin Gin His Thr Val Leu Gin Ala Gin Arg Leu 
705 710 • 715 720 

He Thr Pro Gin Thr Leu Asn Leu Thr Ala Val Asn Glu Ala Val Leu 
725 730 735 

He Glu Asn Leu Glu He Phe Arg Lys Asn Gly Phe Asp Phe Val He 
740 745 750 

Asp Glu Asp Ala Pro Val Thr Glu Arg Ala Lys Leu He Ser Leu Pro 
755 760 765 
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Thr Ser Lys Asn Trp Thr Phe Gly Pro Gin Asp He Asp Glu Leu He 
770 775 780 

Phe Met Leu Ser Asp Ser Pro Gly Val Met Cys Arg Pro Ser Arg Val 

785 790 795 800 

Arg Gin Met Phe Ala Ser Arg Ala Cys Arg Lys Ser Val Met He Gly 
805 810 815 

Thr Ala Leu Asn Ala Ser Glu Met Lys Lys Leu He Thr His Met Gly 
820 825 830 

Glu Met Asp His Pro Trp Asn Cys Pro His Gly Arg Pro Thr Met Arg 
835 840 845 

His Val Ala Asn Leu Asp Val He Ser Gin Asn 
850 855 


<210> 15 
<211> 3056 
<212> DNA 

<213> Mus musculus 
<400> 15 

gaattccggt gaaggtcctg aagaatttcc 
taacctgtcg tcaggtaacg atggtgtata 
gtcttttccc gagagcggca ccgcaactct 
catccatgga gcaaaccgaa ggcgtgagta 
atgggaagtc agtccatcaa atttgttctg 
tgaaggagtt gatagaaaat agtgtagatg 
aagactatgg ggtggacctc attgaagttt 
actttgaagg tctagctctg aaacatcaca 
cgcaggttga aactttcggc tttcgggggg 
atgtcactat atctacctgc cacgggtctg 
ataatgggaa aatcacccag aaaactccct 
tgcagcactt attttataca ctacccgtgc 
aggagtattc caaaatggtg caggtcttac 
gtgtaagctg cactaatcag ctcggacagg 
gcacgtctgg catgaaggaa aatatcgggt 
tcattccttt tgttcagctg ccccctagtg 
cttcaggacg ccacaaaacc ttttctacgt 
cgccgggagg agtgcaacag acaggcagtt 
agcaaaggtc tctaagcttg tcaatgaggt 
catttgtcgt ccttaacgtt tccgttgact 
ataaaaggca aattctacta caagaagaga 
tgataggaat gtttgacagt gatgcaaaca 
atgttgaagg taacttagta aagctgcata 


agattcctga gtatcattgg aggagacaga 60 
tgcaacagaa atgggtgttc ctggagacgc 120 
cccgcggtga ctgtgactgg aggagtcctg 180 
cagaatgtgc taaggccatc aagcctattg 240 
ggcaggtgat actcagttta agcaccgctg 300 
ctggtgctac tactattgat ctaaggctta 360 
cagacaatgg atgtggggta gaagaagaaa 420 
catctaagat tcaagagttt gccgacctca 480 
aagctctgag ctctctgtgt gcactaagtg 540 
caagcgttgg gactcgactg gtgtttgacc 600 
acccccgacc taaaggaacc acagtcagtg 660 
gttacaaaga gtttcagagg aacattaaaa 720 
aggcgtactg tatcatctca gcaggcgtcc 780 
ggaagcggca cgctgtggtg tgcacaagcg 840 
ctgtgtttgg ccagaagcag ttgcaaagcc 900 
acgctgtgtg tgaagagtac ggcctgagca 960 
ttcgggcttc atttcacagt gcacgcacgg 1020 
tttcttcatc aatcagaggc cctgtgaccc 1080 
tttatcacat gtataaccgg catcagtacc 1140 
cagaatgtgt ggatattaat gtaactccag 1200 
agctattgct ggccgtttta aagacctcct 1260 
agcttaatgt caaccagcag ccactgctag 1320 
ctgcagaact agaaaagcct gtgccaggaa 1380 
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agcaagataa ctctccttca ctgaagagca 
ccaggctgag agaggccttt tctcttcatc 
agactgctga actgacacgg agttttccaa 
cttcagacgt catctcttac agaggcctcc 
oggacagccc tggtgactgt atggacagag 
gcacctcagc tggctctgag gaagagttca 
gtgactataa cgtgagctcc ctagaagaca 
acctggactg ccgtcctcca ggtacaggac 
aatgcaaagc tctacctcta gctcgtctgt 
aggaaagacc ctcaaatgtc aacatttctc 
cagctgaggt cgatgtagcc ataaaaatga 
tgagttctct agctaagcga atgaagcagt 
aactgagtta cagaaaattt agggccaaga 
atgaactcag aaaagagatt agtaaatcga 
ttaacctggg atttatagta accaaactga 
ctgcggatga gaagtacaac tttgagatgc 
ggctcatcac accccagact ctgaacttaa 
atctggaaat attcagaaag aatggctttg 
ctgaaagggc taaattgatt tccttaccaa 
atatagatga actgatcttt atgttaagtg 
gagtcagaca gatgtttgct tccagagcct 
tcaatgcgag cgagatgaag aagctcatca 
actgccccca cggcaggcca accatgaggc 
actgacacac cccttgtagc atagagttta 
ttttaagtaa tctgattatc gttgtacaaa 
catttaaaag cagtgttaag gcaggcatga 
tgatccggtg ggagctcatg tgagcccagg 
agactcaatt caaggacaaa aaaaaaaaga 


cagcagacga gaaaagggta gcatccatct 1440 

ctactaaaga gatcaagtct aggggtccag 1500 

gtgagaaaag gggcgtgtta tcctcttatc 1560 

gtggctcgca ggacaaattg gtgagtccca 1620 

agaaaataga aaaagactca gggctcagca 1680 

gcaccccaga agtggccagt agctttagca 1740 

gaccttctca ggaaaccata aactgtggtg 1800 

agtccttgaa gccagaagac catggatatc 1860 

cacccacaaa tgccaagcgc ttcaagacag 1920 

aaagattgcc tggtcctcag agcacctcag 1980 

ataagagaat cgtgctcctc gagttctctc 2040 

tacagcacct aaaggcgcag aacaaacatg 2100 

tttgccctgg agaaaaccaa gcagcagaag 2160 

tgtttgcaga gatggagatc ttgggtcagt 2220 

aagaggacct cttcctggtg gaccagcatg 2280 

tgcagcagca cacggtgctc caggcgcaga 2340 

ctgctgtcaa tgaagctgta ctgatagaaa 2400 

actttgtcat tgatgaggat gctccagtca 24 60 

ctagtaaaaa ctggaccttt ggaccccaag 2520 

acagccctgg ggtcatgtgc cggccctcac 2580 

gtcggaagtc agtgatgatt ggaacggcgc 2640 

cccacatggg tgagatggac cacccctgga 2700 

acgttgccaa tctggatgtc atctctcaga 27 60 

ttacagattg ttcggtttgc aaagagaagg 2820 

aattagcatg ctgctttaat gtactggatc 2880 

tggagtgttc ctctagctca gctacttggg 294 0 

actttgagac cactccgagc cacattcatg 3000 

tatttttgaa gccttttaaa aaaaaa 3056 


<210> 16 

<211> 862 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Glu Arg Ala Glu Ser Ser Ser Thr Glu Pro Ala Lys Ala He Lys 
15 10 15 

Pro He Asp Arg Lys Ser Val His Gin He Cys Ser Gly Gin Val Val 
20 25 30 

Leu Ser Leu Ser Thr Ala Val Lys Glu Leu Val Glu Asn Ser Leu Asp 
35 40 45 


Ala Gly Ala Thr Asn He Asp Leu Lys Leu Lys Asp Tyr Gly Val Asp 
50 55 60 


WO 02/054856 


PCT/US01/00934 


Leu lie Glu Val Ser Asp Asn Gly Cys Gly Val Glu Glu Glu Asn Phe 
65 70 75 80 

Glu Gly Leu Thr Leu Lys His His Thr Ser Lys He Gin Glu Phe Ala 
85 90 95 

Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg Gly Glu Ala Leu Ser 
100 105 HO 

Ser Leu Cys Ala Leu Ser Asp Val Thr He Ser Thr Cys His Ala Ser 
115 120 125 

Ala Lys Val Gly Thr Arg Leu Met Phe Asp His Asn Gly Lys He He 
130 135 140 

Gin Lys Thr Pro Tyr Pro Arg Pro Arg Gly Thr Thr Val Ser Val Gin 
145 150 155 160 

Gin Leu Phe Ser Thr Leu Pro Val Arg His Lys Glu Phe Gin Arg Asn 
165 170 175 

He Lys Lys Glu Tyr Ala Lys Met Val Gin Val Leu His Ala Tyr Cys 
180 185 190 

He He Ser Ala Gly He Arg Val Ser Cys Thr Asn Gin Leu Gly Gin 
195 200 205 

Gly Lys Arg Gin Pro Val Val Cys Thr Gly Gly Ser Pro Ser He Lys 
210 215 220 

Glu Asn He Gly Ser Val Phe Gly Gin Lys Gin Leu Gin Ser Leu He 
225 230 235 240 

Pro Phe Val Gin Leu Pro Pro Ser. Asp Ser Val Cys Glu Glu Tyr Gly 
245 250 255 

Leu Ser Cys Ser Asp Ala Leu His Asn Leu Phe Tyr He Ser Gly Phe 
260 265 270 

He Ser Gin Cys Thr His Gly Val Gly Arg Ser Ser Thr Asp Arg Gin 
275 280 285 

Phe Phe Phe He Asn Arg Arg Pro Cys Asp Pro Ala Lys Val Cys Arg 
290 295 300 

Leu Val Asn Glu Val Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 
305 310 315 320 
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Val Val Leu Asn He Ser Val Asp Ser Glu Cys Val Asp He Asn Val 
325 330 335 


Thr Pro Asp Lys Arg Gin He Leu Leu Gin Glu Glu Lys Leu Leu Leu 
340 345 350 


Ala Val Leu Lys Thr Ser Leu He Gly Met Phe Asp Ser Asp Val Asn 
355 360 365 


Lys Leu Asn Val Ser Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 
370 375 380 


He Lys Met His Ala Ala Asp Leu Glu Lys Pro Met Val Glu Lys Gin 
385 390 395 400 


Asp Gin Ser Pro Ser Leu Arg Thr Gly Glu Glu Lys Lys Asp Val Ser 
405 410 415 


He Ser Arg Leu Arg Glu Ala Phe Ser Leu Arg His Thr Thr Glu Asn 
420 425 430 


Lys Pro His Ser Pro Lys Thr Pro Glu Pro Arg Arg Ser Pro Leu Gly 
435 440 445 


Gin Lys Arg Gly Met Leu Ser Ser Ser Thr Ser Gly Ala He Ser. Asp 
450 455 460 


Lys Gly Val Leu Arg Pro Gin Lys Glu Ala Val Ser Ser Ser His Gly 
465 470 475 480 


Pro Ser Asp Pro Thr Asp Arg Ala Glu Val Glu Lys Asp Ser Gly His 
485 490 495 


Gly Ser Thr Ser Val Asp Ser Glu Gly Phe Ser He Pro Asp Thr Gly 
500 505 510 


Ser His Cys Ser Ser Glu Tyr Ala Ala Ser Ser Pro Gly Asp Arg Gly 
515 520 525 


Ser Gin Glu His Val Asp Ser Gin Glu Lys Ala Pro Glu Thr Asp Asp 
530 535 ' 540 


Ser Phe Ser Asp Val Asp Cys His Ser Asn Gin Glu Asp Thr Gly Cys 
545 550 555 560 


Lys Phe Arg Val Leu Pro Gin Pro Thr Asn Leu Ala Thr Pro Asn Thr 
565 570 575 
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Lys Arg Phe Lys Lys Glu Glu lie Leu Ser Ser Ser Asp He Cys Gin 

580 585 590 

Lys Leu Val Asn Thr Gin Asp Met Ser Ala Ser Gin Val Asp Val Ala 

595 600 605 

Val Lys He Asn Lys Lys Val Val Pro Leu Asp Phe Ser Met Ser Ser 

610 615 620 

Leu Ala Lys Arg He Lys Gin Leu His His Glu Ala Gin Gin Ser Glu 

625 630 635 640 

Gly Glu Gin Asn Tyr Arg Lys Phe Arg Ala Lys He Cys Pro Gly Glu 

645 650 655 

Asn Gin Ala Ala Glu Asp Glu Leu Arg Lys Glu He Ser Lys Thr Met 

660 . 665 670 

Phe Ala Glu Met Glu He He Gly Gin Phe Asn Leu Gly Phe He He 

675 680 685 

Thr Lys Leu Asn Glu Asp He Phe He Val Asp Gin His Ala Thr Asp 

690 695 700 

Glu Lys Tyr Asn Phe Glu Met Leu Gin Gin His Thr Val Leu Gin Gly 

705 710 715 720 

Gin Arg Leu He Ala Pro Gin Thr Leu Asn Leu Thr Ala Val Asn Glu 


Ala Val Leu He Glu Asn Leu Glu He Phe Arg Lys Asn Gly Phe Asp 
740 745 750 

Phe Val He Asp Glu Asn Ala Pro Val Thr Glu Arg Ala Lys Leu He 
755 760 765 

Ser Leu Pro Thr Ser Lys Asn Trp Thr Phe Gly Pro Gin Asp Val Asp 
770 775 780 

Glu Leu He Phe Met Leu Ser Asp Ser Pro Gly Val Met Cys Arg Pro 
785 790 795 800 

Ser Arg Val Lys Gin Met Phe Ala Ser Arg Ala Cys Arg Lys Ser Val 
805 810 815 

Met lie Gly Thr Ala Leu Asn Thr Ser Glu Met Lys Lys Leu He Thr 


725 


730 


735 


820 


825 


830 
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His Met Gly Glu Met Asp His Pro Trp Asn Cys Pro His Gly Arg Pro 

835 840 845 

Thr Met Arg His He Ala Asn Leu Gly Val He Ser Gin Asn 
850 855 860 


<210> 17 
<211> 2771 
<212> DNA 

<213> Homo sapiens 
<400> 17 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 
aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 
ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 
aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 
tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 
caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 
tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 
actcgactga tgtttgatca caatgggaaa attatccaga aaacccccta cccccgcccc 480 
agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 540 
tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 600 
atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 660 
cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 720 
cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 780 
gaagagtacg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 840 
atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttatc 900 
aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 960 
tataatcgac accagtatcc atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1020 
gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1080 
gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1140 
agtcagcagc cactgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1200 
gaaaagccca tggtagaaaa gcaggatcaa tccccttcat taaggactgg agaagaaaaa 1260 
aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1320 
aagcctcaca gcccaaagac tccagaacca agaaggagcc ctctaggaca gaaaaggggt 1380 
atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 1440 
gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1500 
gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1560 
agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 1620 
gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat 1680 
tcaaaccagg aagataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 1740 
accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1800 
aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 18 60 
aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1920 
catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1980 
tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2040 
tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2100 
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gaggatatct tcatagtgga ccagcatgcc acggacgaga agtataactt cgagatgctg 2160 
cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ctcagactct caacttaact 2220 
gctgttaatg aagctgttct gatagaaaat ctggaaatat ttagaaagaa tggctttgat 2280 
tttgttatcg atgaaaatgc tccagtcact gaaagggcta aactgatttc cttgccaact 2340 
agtaaaaact ggaccttcgg accccaggac gtcgatgaac tgatcttcat gctgagcgac 2400 
agccctgggg tcatgtgccg gccttcccga gtcaagcaga tgtttgcctc cagagcctgc 2460 
cggaagtcgg tgatgattgg gactgctctt aacacaagcg agatgaagaa actgatcacc 2520 
cacatggggg agatggacca cccctggaac tgtccccatg gaaggccaac catgagacac 2580 
atcgccaacc tgggtgtcat ttctcagaac tgaccgtagt cactgtatgg aataattggt 2640 
tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2700 
atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgagaac 2760 
cttttcaaac c 2771 


<210> 18 

<211> 932 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 
15 10 15 

lie He Thr Ser Val Val Ser Val Val Lys Glu Leu He Glu Asn Ser 
20 25 30 

Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 
35 40 45 

Phe Asp Lys He Glu Val Arg Asp Asn Gly Glu Gly He Lys Ala Val 
50 55 60 

Asp Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys He Asn Ser 
65 70 75 80 

His Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Arg Gly Glu Ala 
85 90 95 

Leu Gly Ser He Cys Cys He Ala Glu Val Leu He Thr Thr Arg Thr 
100 . 105 110 

Ala Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 125 

He Leu Ser Gin Lys Pro Ser His Leu Gly Gin Gly Thr Thr Val Thr 
130 135 140. 

Ala Leu Arg Leu Phe Lys Asn Leu Pro Val Arg Lys Gin Phe Tyr Ser 
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160 


Thr Ala Lys Lys Cys Lys Asp Glu He Lys Lys He Gin Asp Leu Leu 
165 170 175 

Met Ser Phe Gly He Leu Lys Pro Asp Leu Arg He Val Phe Val His 
180 185 190 

Asn Lys Ala Val He Trp Gin Lys Ser Arg Val Ser Asp His Lys Met 
195 200 205 

Ala Leu Met Ser Val Leu Gly Thr Ala Val Met Asn Asn Met Glu Ser 
210 215 220 

Phe Gin Tyr His Ser Glu Glu Ser Gin He Tyr Leu Ser Gly Phe Leu 
225 230 235 240 

Pro Lys Cys Asp Ala Asp His Ser Phe Thr Ser Leu Ser Thr Pro Glu 
245 250 255 

Arg Ser Phe He Phe He Asn Ser Arg Pro Val His Gin Lys Asp He 
260 265 270 

Leu Lys Leu He Arg His His Tyr Asn Leu Lys Cys Leu Lys Glu Ser 
275 280 285 

Thr Arg Leu Tyr Pro Val Phe Phe Leu Lys He Asp Val Pro Thr Ala 
290 295 300 

Asp Val Asp Val Asn Leu Thr Pro Asp Lys Ser Gin Val Leu Leu Gin 
305 310 315 320 

Asn Lys Glu Ser Val Leu He Ala Leu Glu Asn Leu Met Thr Thr Cys 
325 330 335 

Tyr Gly Pro Leu Pro Ser Thr Asn Ser Tyr Glu Asn Asn Lys Thr Asp 
340 345 350 

Val Ser Ala Ala Asp He Val Leu Ser Lys Thr Ala Glu Thr Asp Val 
355 360 365 

Leu Phe Asn Lys Val Glu Ser Ser Gly Lys Asn Tyr Ser Asn Val Asp 
370 375 380 

Thr Ser Val He Pro Phe Gin Asn Asp Met His Asn Asp Glu Ser Gly 
385 390 395 400 

Lys Asn Thr Asp Asp Cys Leu Asn His Gin He Ser He Gly Asp Phe 
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415 


Gly Tyr Gly His Cys Ser Ser Glu He Ser Asn He Asp Lys Asn Thr 
420 425 430 

Lys Asn Ala Phe Gin Asp He Ser Met Ser Asn Val Ser Trp Glu Asn 
435 440 445 

Ser Gin Thr Glu Tyr Ser Lys Thr Cys Phe He Ser Ser Val Lys His 
450 455 460 

Thr Gin Ser Glu Asn Gly Asn Lys Asp His He Asp Glu Ser Gly Glu 
465 470 475 480 

Asn Glu Glu Glu Ala Gly Leu Glu Asn Ser Ser Glu He Ser Ala Asp 
485 490 495 

Glu Trp Ser Arg Gly Asn He Leu Lys Asn Ser Val Gly Glu Asn He 
500 505 510 

Glu Pro Val Lys He Leu Val Pro Glu Lys Ser Leu Pro Cys Lys Val 
515 520 525 

Ser Asn Asn Asn Tyr Pro He Pro Glu Gin Met Asn Leu Asn Glu Asp 
530 535 540 

Ser Cys Asn Lys Lys Ser Asn Val He Asp Asn Lys Ser Gly Lys Val 
545 550 555 560 

Thr Ala Tyr Asp Leu Leu Ser Asn Arg Val He Lys Lys Pro Met Ser 
565 570 575 

Ala Ser Ala Leu Phe Val Gin Asp His Arg Pro Gin Phe Leu He Glu 
580 585 590 

Asn Pro Lys Thr Ser Leu Glu Asp Ala Thr Leu Gin He Glu Glu Leu 
595 600 . 605 

Trp Lys Thr Leu Ser Glu Glu Glu Lys Leu Lys Tyr Glu Glu Lys Ala 
610 615 620 

Thr Lys Asp Leu Glu Arg Tyr Asn Ser Gin Met Lys Arg Ala He Glu 
625 630 635 640 

Gin Glu Ser Gin Met Ser Leu Lys Asp Gly Arg Lys Lys He Lys Pro 
645 650 655 

Thr Ser Ala Trp Asn Leu Ala Gin Lys His Lys Leu Lys Thr Ser Leu 
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Ser Asn Gin Pro Lys Leu Asp Glu Leu Leu Gin Ser Gin lie Glu Lys 
675 680 685 


Arg Arg Ser Gin Asn He Lys Met Val Gin He Pro Phe Ser Met Lys 
690 695 700 


Asn Leu Lys He Asn Phe Lys Lys Gin Asn Lys Val Asp Leu Glu Glu 
705 710 715 720 


Lys Asp Glu Pro Cys Leu He His Asn Leu Arg Phe Pro Asp Ala Trp 
725 730 735 


Leu Met Thr Ser Lys Thr Glu Val Met Leu Leu Asn Pro Tyr Arg Val 
740 745 750 


Glu Glu Ala Leu Leu Phe Lys Arg Leu Leu Glu Asn His Lys Leu Pro 
755 760 765 


Ala Glu Pro Leu Glu Lys Pro He Met Leu Thr Glu Ser Leu Phe Asn 
770 775 780 


Gly Ser His Tyr Leu Asp Val Leu Tyr Lys Met Thr Ala Asp Asp Gin 
785 790 795 800 


Arg Tyr Ser Gly Ser Thr Tyr Leu Ser Asp Pro Arg Leu Thr Ala Asn 
805 810 815 


Gly Phe Lys He Lys Leu He Pro Gly Val Ser lie Thr Glu Asn Tyr 
820 825 830 


Leu Glu He Glu Gly Met Ala Asn Cys Leu Pro Phe Tyr Gly Val Ala 
835 840 845 


Asp Leu Lys Glu He Leu Asn Ala He Leu Asn Arg Asn Ala Lys Glu 
850 855 860 


Val Tyr Glu Cys Arg Pro Arg Lys Val He Ser Tyr Leu Glu Gly Glu 
865 870 875 880 


Ala Val Arg Leu Ser Arg Gin Leu Pro Met Tyr Leu Ser Lys Glu Asp 
885 890 895 


He Gin Asp He He Tyr Arg Met Lys His Gin Phe Gly Asn Glu He 
900 905 910 


Lys Glu Cys Val His Gly Arg Pro Phe Phe His His Leu Thr Tyr Leu 
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Pro Glu Thr Thr 
930 


<210> 19 

<211> 3063 

<212> DNA 

<213> Homo sapiens 

<400> 19 

ggcacgagtg gctgcttgcg gctagtggat 
ctgctctgtt aaaagcgaaa atgaaacaat 
gttctcagat catcacttcg gtggtcagtg 
atgctggtgc cacaagcgta gatgttaaac 
tgcgagataa cggggagggt atcaaggctg 
acacctcaaa aataaatagt catgaagatc 
gagaagcctt ggggtcaatt tgttgtatag 
ctgataattt tagcacccag tatgttttag 
cttcacatct tggtcaaggt acaactgtaa 
taagaaagca gttttactca actgcaaaaa 
atctcctcat gagctttggt atccttaaac 
aggcagttat ttggcagaaa agcagagtat 
tggggactgc tgttatgaac aatatggaat 
tttatctcag tggatttctt ccaaagtgtg 
caccagaaag aagtttcatc ttcataaaca 
agttaatccg acatcattac aatctgaaat 
ttttctttct gaaaatcgat gttcctacag 
aaagccaagt attattacaa aataaggaat 
cgacttgtta tggaccatta cctagtacaa 
ccgcagctga catcgttctt agtaaaacag 
aatcatctgg aaagaattat tcaaatgttg 
tgcataatga tgaatctgga aaaaacactg 
gtgactttgg ttatggtcat tgtagtagtg 
atgcatttca ggacatttca atgagtaatg 
gtaaaacttg ttttataagt tccgttaagc 
atatagatga gagtggggaa aatgaggaag 
ctgcagatga gtggagcagg ggaaatatac 
ctgtgaaaat tttagtgcct gaaaaaagtt 
caatccctga acaaatgaat cttaatgaag 
ataataaatc tggaaaagtt acagcttatg 
ccatgtcagc aagtgctctt tttgttcaag 
ctaagactag tttagaggat gcaacactac 
aagaggaaaa actgaaatat gaagagaagg 
aaatgaagag agccattgaa caggagtcac 
taaaacccac cagcgcatgg aatttggccc 
atcaaccaaa acttgatgaa ctccttoagt 


ggtaattgcc tgcctcgcgc tagcagcaag 60 
tgcctgcggc aacagttcga ctcctttcaa 120 
ttgtaaaaga gcttattgaa aactccttgg 180 
tggagaacta tggatttgat aaaattgagg 240 
ttgatgcacc tgtaatggca atgaagtact 300 
ttgaaaattt gacaacttac ggttttcgtg 360 
ctgaggtttt aattacaaca agaacggctg 420 
atggcagtgg ccacatactt tctcagaaac 480 
ctgctttaag attatttaag aatctacctg 540 
aatgtaaaga tgaaataaaa aagatccaag 600 
ctgacttaag gattgtcttt gtacataaca 660 
cagatcacaa gatggctctc atgtcagttc 720 
cctttcagta ccactctgaa gaatctcaga 780 
atgcagacca ctctttcact agtctttcaa 840 
gtcgaccagt acatcaaaaa gatatcttaa 900 
gcctaaagga atctactcgt ttgtatcctg 960 
ctgatgttga tgtaaattta acaccagata 1020 
ctgttttaat tgctcttgaa aatctgatga 1080 
attcttatga aaataataaa acagatgttt 1140 
cagaaacaga tgtgcttttt aataaagtgg 1200 
atacttcagt cattccattc caaaatgata 1260 
atgattgttt aaatcaccag ataagtattg 1320 
aaatttctaa cattgataaa aacactaaga 1380 
tatcatggga gaactctcag acggaatata 1440 
acacccagtc agaaaatggc aataaagacc 1500 
aagcaggtct tgaaaactct tcggaaattt 1560 
ttaaaaattc agtgggagag aatattgaac 1620 
taccatgtaa agtaagtaat aataattatc 1680 
attcatgtaa caaaaaatca aatgtaatag 1740 
atttacttag caatcgagta atcaagaaac 1800 
atcatcgtcc tcagtttctc atagaaaatc 18 60 
aaattgaaga actgtggaag acattgagtg, 1920 
ctactaaaga cttggaacga tacaatagtc 1980 
aaatgtcact aaaagatggc agaaaaaaga 2040 
agaagcacaa gttaaaaacc tcattatcta 2100 
cccaaattga aaaaagaagg agtcaaaata 2160 
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ttaaaatggt acagatcccc ttttctatga 

acaaagttga cttagaagag aaggatgaac 

atgcatggct aatgacatcc aaaacagagg 

aagccctgct atttaaaaga cttcttgaga 

agccaattat gttaacagag agtcttttta 

aaatgacagc agatgaccaa agatacagtg 

cagcgaatgg tttcaagata aaattgatac 

aaatagaagg aatggctaat tgtctcccat 

ttaatgctat attaaacaga aatgcaaagg 

taagttattt agagggagaa gcagtgcgtc 

aagaggacat ccaagacatt atctacagaa 

agtgtgttca tggtcgccca ttttttcatc 

taaatatgtt taagaagatt agttaccatt 

tctggtttta aattatcttt gtattatgtg 

ctgacttgtt tttatattga aaaaagttcc 
aac 


aaaacttaaa aataaatttt aagaaacaaa 2220 
cttgcttgat ccacaatctc aggtttcctg 2280 
taatgttatt aaatccatat agagtagaag 2340 
atcataaact tcctgcagag ccactggaaa 2400 
atggatctca ttatttagac gttttatata 24 60 
gatcaactta cctgtctgat cctcgtctta 2520 
caggagtttc aattactgaa aattacttgg 2580 
tctatggagt agcagattta aaagaaattc 2640 
aagtttatga atgtagacct cgcaaagtga 2700 
tatccagaca attacccatg tacttatcaa 2760 
tgaagcacca gtttggaaat gaaattaaag 2820 
atttaaccta tcttccagaa actacatgat 2880 
gaaattggtt ctgtcataaa acagcatgag 2940 
tcacatggtt attttttaaa tgaggattca 3000 
acgtattgta gaaaacgtaa ataaactaat 3060 
3063 


<210> 20 

<211> 934 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Ala Val Gin Pro Lys Glu Thr Leu Gin Leu Glu Ser Ala Ala Glu 
15 10 15 

Val Gly Phe Val Arg Phe Phe Gin Gly Met Pro Glu Lys Pro Thr Thr 
20 25 30 

Thr Val Arg Leu Phe Asp Arg Gly Asp Phe Tyr Thr Ala His Gly Glu 
35 40 45 

Asp Ala Leu Leu Ala Ala Arg Glu Val Phe Lys Thr Gin Gly Val lie 
50 55 60 

Lys Tyr Met Gly Pro Ala Gly Ala Lys Asn Leu Gin Ser Val Val Leu 
65 70 75 80 

Ser Lys Met Asn Phe Glu Ser Phe Val Lys Asp Leu Leu Leu Val Arg 
85 90 95 

Gin Tyr Arg Val Glu Val Tyr Lys Asn Arg Ala Gly Asn Lys Ala Ser 
100 105 110 

Lys Glu Asn Asp Trp Tyr Leu Ala Tyr Lys Ala Ser Pro Gly Asn Leu 
115 120 125 
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Ser Gin Phe Glu Asp He Leu Phe Gly Asn Asn Asp Met Ser Ala Ser 
130 135 140 

He Gly Val Val Gly Val Lys Met Ser Ala Val Asp Gly Gin Arg Gin 
145 150 155 160 

Val Gly Val Gly Tyr Val Asp Ser He Gin Arg Lys Leu Gly Leu Cys 
165 170 175 

Glu Phe Pro Asp Asn Asp Gin Phe Ser Asn Leu Glu Ala Leu Leu He 
180 185 190 

Gin lie Gly Pro Lys Glu Cys Val Leu Pro Gly Gly Glu Thr Ala Gly 
195 200 205 

Asp Met Gly Lys Leu Arg Gin He He Gin Arg Gly Gly He Leu He 
210 215 220 

Thr Glu Arg Lys Lys Ala Asp Phe Ser Thr Lys Asp He Tyr Gin Asp 
225 230 235 240 

Leu Asn Arg Leu Leu Lys Gly Lys Lys Gly Glu Gin Met Asn Ser Ala 
245 250 255 

Val Leu Pro Glu Met Glu Asn Gin Val Ala Val Ser Ser Leu Ser Ala 
260 265 270 

Val He Lys Phe Leu Glu Leu Leu Ser Asp Asp Ser Asn Phe Gly Gin 
275 280 285 

Phe Glu Leu Thr Thr Phe Asp Phe Ser Gin Tyr Met Lys Leu Asp He 
290 295 300 

Ala Ala Val Arg Ala Leu Asn Leu Phe Gin Gly Ser Val Glu Asp Thr 
305 310 315 320 

Thr Gly Ser Gin Ser Leu Ala Ala Leu Leu Asn Lys Cys Lys Thr Pro 
325 330 335 

Gin Gly Gin Arg Leu Val Asn Gin Trp He Lys Gin Pro Leu Met Asp 
340 345 350 

Lys Asn Arg He Glu Glu Arg Leu Asn Leu Val Glu Ala Phe Val Glu 
355 360 365 

Asp Ala Glu Leu Arg Gin Thr Leu Gin Glu Asp Leu Leu Arg Arg Phe 
370 375 380 
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Pro Asp Leu Asn Arg Leu Ala Lys Lys Phe Gin Arg Gin Ala Ala Asn 
385 390 395 400 

Leu Gin Asp Cys Tyr Arg Leu Tyr Gin Gly lie Asn Gin Leu Pro Asn 
405 410 415 

Val lie Gin Ala Leu Glu Lys His Glu Gly Lys His Gin Lys Leu Leu 
420 425 430 

Leu Ala Val Phe Val Thr Pro Leu Thr Asp Leu Arg Ser Asp Phe Ser 
435 440 445 

Lys Phe Gin Glu Met lie Glu Thr Thr Leu Asp Met Asp Gin Val Glu 
450 455 460 

Asn His Glu Phe Leu Val Lys Pro Ser Phe Asp Pro Asn Leu Ser Glu 
465 470 475 480 

Leu Arg Glu lie Met Asn Asp Leu Glu Lys Lys Met Gin Ser Thr Leu 
485 490 495 

lie Ser Ala Ala Arg Asp Leu Gly Leu Asp Pro Gly Lys Gin lie Lys 
500 505 510 

Leu Asp Ser Ser Ala Gin Phe Gly Tyr Tyr Phe Arg Val Thr Cys Lys 
515 520 525 

Glu Glu Lys Val Leu Arg Asn Asn Lys Asn Phe Ser Thr Val Asp lie 
530 535 540 

Gin Lys Asn Gly Val Lys Phe Thr Asn Ser Lys Leu Thr Ser Leu Asn 
545 550 555 560 

Glu Glu Tyr Thr Lys Asn Lys Thr Glu Tyr Glu Glu Ala Gin Asp Ala 
565 570 575 

lie Val Lys Glu lie Val Asn lie Ser Ser Gly Tyr Val Glu Pro Met 
580 585 590 

Gin Thr Leu Asn Asp Val Leu Ala Gin Leu Asp Ala Val Val Ser Phe 
595 600 605 

Ala His Val Ser Asn Gly Ala Pro Val Pro Tyr Val Arg Pro Ala lie 
610 615 620 


Leu Glu Lys Gly Gin Gly Arg lie lie Leu Lys Ala Ser Arg His Ala 
625 630 635 640 
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Cys Val Glu Val Gin Asp Glu He Ala Phe He Pro Asn Asp Val Tyr 
645 650 655 

Phe Glu Lys Asp Lys Gin Met Phe His He He Thr Gly Pro Asn Met 
660 665 670 

Gly Gly Lys Ser Thr Tyr He Arg Gin Thr Gly Val He Val Leu Met 
675 680 685 

Ala Gin He Gly Cys Phe Val Pro Cys Glu Ser Ala Glu Val Ser He 
690 695 700 

Val Asp Cys He Leu Ala Arg Val Gly Ala Gly Asp Ser Gin Leu Lys 
705 710 715 720 

Gly Val Ser Thr Phe Met Ala Glu Met Leu Glu Thr Ala Ser He Leu 
725 730 735 

Arg Ser Ala Thr Lys Asp Ser Leu He He He Asp Glu Leu Gly Arg 
740 745 750 

Gly Thr Ser Thr Tyr Asp Gly Phe Gly Leu Ala Trp Ala He Ser Glu 
755 • 760 765 


Tyr He Ala Thr Lys He Gly Ala Phe Cys Met Phe Ala Thr His Phe 
770 775 780 


His Glu Leu Thr Ala Leu Ala Asn Gin He Pro Thr Val Asn Asn Leu 
785 790 795 800 


His Val Thr Ala Leu Thr Thr Glu Glu Thr Leu Thr Met Leu Tyr Gin 
805 810 815 


Val Lys Lys Gly Val Cys Asp Gin Ser Phe Gly He His Val Ala Glu 
820 825 830 


Leu Ala Asn Phe Pro Lys His Val He Glu Cys Ala Lys Gin Lys Ala 
835 840 845 


Leu Glu Leu Glu Glu Phe Gin Tyr He Gly Glu Ser Gin Gly Tyr Asp 
850 855 860 


He Met Glu Pro Ala Ala Lys Lys Cys Tyr Leu Glu Arg Glu Gin Gly 
865 870 875 880 


Glu Lys He He Gin Glu Phe Leu Ser Lys Val Lys Gin Met Pro Phe 
885 890 895 
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Thr Glu Met Ser Glu Glu Asn He Thr He Lys Leu Lys Gin Leu Lys 
900 905 910 

Ala Glu Val He Ala Lys Asn Asn Ser Phe Val Asn Glu He He Ser 
915 920 925 

Arg He Lys Val Thr Thr 
930 


<210> 21 

<211> 3145 

<212> DNA 

<213> Homo sapiens 

<400> 21 

ggcgggaaac agcttagtgg gtgtggggtc gcgcattttc ttcaaccagg aggtgaggag 60 
gtttcgacat ggcggtgcag ccgaaggaga cgctgcagtt ggagagcgcg gccgaggtcg 120 
gcttcgtgcg cttctttcag ggcatgccgg agaagccgac caccacagtg cgccttttcg 180 
accggggcga cttctatacg gcgcacggcg aggacgcgct gctggccgcc cgggaggtgt 240 
tcaagaccca gggggtgatc aagtacatgg ggccggcagg agcaaagaat ctgcagagtg 300 
ttgtgcttag taaaatgaat tttgaatctt ttgtaaaaga tcttcttctg gttcgtcagt 360 
atagagttga agtttataag aatagagctg gaaataaggc atccaaggag aatgattggt 420 
atttggcata taaggcttct cctggcaatc tctctcagtt tgaagacatt ctctttggta 480 
acaatgatat gtcagcttcc attggtgttg tgggtgttaa aatgtccgca gttgatggcc 540 
agagacaggt tggagttggg tatgtggatt ccatacagag gaaactagga ctgtgtgaat 600 
tccctgataa tgatcagttc tccaatcttg aggctctcct catccagatt ggaccaaagg 660 
aatgtgtttt acccggagga gagactgctg gagacatggg gaaactgaga cagataattc 720 
aaagaggagg aattctgatc acagaaagaa aaaaagctga cttttccaca aaagacattt 780 
atcaggacct caaccggttg ttgaaaggca aaaagggaga gcagatgaat agtgctgtat 840 
tgccagaaat ggagaatcag gttgcagttt catcactgtc tgcggtaatc aagtttttag 900 
aactcttatc agatgattcc aactttggac agtttgaact gactactttt gacttcagcc 960 
agtatatgaa attggatatt gcagcagtca gagcccttaa cctttttcag ggttctgttg 1020 
aagataccac tggctctcag tctctggctg ccttgctgaa taagtgtaaa acccctcaag 1080 
gacaaagact tgttaaccag tggattaagc agcctctcat ggataagaac agaatagagg 1140 
agagattgaa tttagtggaa gcttttgtag aagatgcaga attgaggcag actttacaag 1200 
aagatttact tcgtcgattc ccagatctta accgacttgc caagaagttt caaagacaag 1260 
cagcaaactt acaagattgt taccgactct atcagggtat aaatcaacta cctaatgtta 1320 
tacaggctct ggaaaaacat gaaggaaaac accagaaatt attgttggca gtttttgtga 1380 
ctcctcttac tgatcttcgt tctgacttct ccaagtttca ggaaatgata gaaacaactt 1440 
tagatatgga tcaggtggaa aaccatgaat tccttgtaaa accttcattt gatcctaatc 1500 
tcagtgaatt aagagaaata atgaatgact tggaaaagaa gatgcagtca acattaataa 1560 
gtgcagccag agatcttggc ttggaccctg gcaaacagat taaactggat tccagtgcac 1620 
agtttggata ttactttcgt gtaacctgta aggaagaaaa agtccttcgt aacaataaaa 1680 
actttagtac tgtagatatc cagaagaatg gtgttaaatt taccaacagc aaattgactt 1740 
ctttaaatga agagtatacc aaaaataaaa cagaatatga agaagcccag gatgccattg 1800 
ttaaagaaat tgtcaatatt tcttcaggct atgtagaacc aatgcagaca ctcaatgatg 1860 
tgttagctca gctagatgct gttgtcagct ttgctcacgt gtcaaatgga gcacctgttc 1920 
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catatgtacg accagccatt ttggagaaag 
ggcatgcttg tgttgaagtt caagatgaaa 
aaaaagataa acagatgttc cacatcatta 
atattcgaca aactggggtg atagtactca 
agtcagcaga agtgtccatt gtggactgca 
aattgaaagg agtctccacg ttcatggctg 
ctgcaaccaa agattcatta ataatcatag 
atggatttgg gttagcatgg gctatatcag 
gcatgtttgc aacccatttt catgaactta 
ataatctaca tgtcacagca ctcaccactg 
agaaaggtgt ctgtgatcaa agttttggga 
agcatgtaat agagtgtgct aaacagaaag 
gagaatcgca aggatatgat atcatggaac 
agcaaggtga aaaaattatt caggagttcc 
aaatgtcaga agaaaacatc aca,ataaagt 
agaataatag ctttgtaaat gaaatcattt 
cagtaatgga atgaaggtaa tattgataag 
atattaaccc tttttccata gtgttaactg 
atatttagta atattttact ttgaggacat 
gctgtaactg aggactgttt gcaattgaca 
ataaataaaa tcatgtagtt tgtgg 


gacaaggaag aattatatta aaagcatcca 1980 
ttgcatttat tcctaatgac gtatactttg 204 0 
ctggccccaa tatgggaggt aaatcaacat 2100 
tggcccaaat tgggtgtttt gtgccatgtg 2160 
tcttagcccg agtaggggct ggtgacagtc 2220 
aaatgttgga aactgcttct atcctcaggt 2280 
atgaattggg aagaggaact tctacctacg 2340 
aatacattgc aacaaagatt ggtgcttttt 2400 
ctgccttggc caatcagata ccaactgtta 24 60 
aagagacctt aactatgctt tatcaggtga 2520 
ttcatgttgc agagcttgct aatttcccta 2580 
ccctggaact tgaggagttt cagtatattg 264 0 
cagcagcaaa gaagtgctat ctggaaagag 2700 
tgtccaaggt gaaacaaatg ccctttactg 2760 
taaaacagct aaaagctgaa gtaatagcaa 2820 
cacgaataaa agttactacg tgaaaaatcc 2880 
ctattgtctg taatagtttt atattgtttt 2940 
tcagtgccca tgggctatca acttaataag 3000 
tttcaaagat ttttattttg aaaaatgaga 3060 
taggcaataa taagtgatgt gctgaatttt 3120 
3145 


<210> 22 
<211> 756 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Met Ser Phe Val Ala Gly Val lie Arg Arg Leu Asp Glu Thr Val Val 
15 10 15 

Asn Arg lie Ala Ala Gly Glu Val He Gin Arg Pro Ala Asn Ala He 
20 25 30 

Lys Glu Met He Glu Asn Cys Leu Asp Ala Lys Ser Thr Ser He Gin 
35 40 45 

Val He Val Lys Glu Gly Gly Leu Lys Leu He Gin He Gin Asp Asn 
50 55 60 

Gly Thr Gly He Arg Lys Glu Asp Leu Asp He Val Cys Glu Arg Phe 
65 70 75 80 

Thr Thr Ser Lys Leu Gin Ser Phe Glu Asp Leu Ala Ser He Ser Thr 
85 90 95 

Tyr Gly Phe Arg Gly Glu Ala Leu Ala Ser He Ser His Val Ala His 
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110 


Val Thr lie Thr Thr Lys Thr Ala Asp Gly Lys Cys Ala Tyr Arg Ala 
115 120 125 

Ser Tyr Ser Asp Gly Lys Leu Lys Ala Pro Pro Lys Pro Cys Ala Gly 
130 135 140 

Asn Gin Gly Thr Gin He Thr Val Glu Asp Leu Phe Tyr Asn lie Ala 
145 150 155 160 

Thr Arg Arg Lys Ala Leu Lys Asn Pro Ser Glu Glu Tyr Gly Lys He 
165 170 175 

Leu Glu Val Val Gly Arg Tyr Ser Val His Asn Ala Gly He Ser Phe 
180 185 190 

Ser Val Lys Lys Gin Gly Glu Thr Val Ala Asp Val Arg Thr Leu Pro 
195 200 205 

Asn Ala Ser Thr Val Asp Asn He Arg Ser He Phe Gly Asn Ala Val 
210 215 220 

Ser Arg Glu Leu He Glu He Gly Cys Glu Asp Lys Thr Leu Ala Phe 
225 230 235 240 

Lys Met Asn Gly Tyr He Ser Asn Ala Asn Tyr Ser Val Lys Lys Cys 
245 250 255 

He Phe Leu Leu Phe He Asn His Arg Leu Val Glu Ser Thr Ser Leu 
260 265 270 

Arg Lys Ala He Glu Thr Val Tyr Ala Ala Tyr Leu Pro Lys Asn Thr 
275 280 285 

His Pro Phe Leu Tyr Leu Ser Leu Glu He Ser Pro Gin Asn Val Asp 
290 295 300 

Val Asn Val His Pro Thr Lys His Glu Val His Phe Leu His Glu Glu 
305 310 315 320 

Ser He Leu Glu Arg Val Gin Gin His He Glu Ser Lys Leu Leu Gly 
325 330 335 

Ser Asn Ser Ser Arg Met Tyr Phe Thr Gin Thr Leu Leu Pro Gly Leu 
340 345 350 

Ala Gly Pro Ser Gly Glu Met Val Lys Ser Thr Thr Ser Leu Thr Ser 
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365 


Ser Ser Thr Ser Gly Ser Ser Asp Lys Val Tyr Ala His Gin Met Val 
370 375 380 


Arg Thr Asp Ser Arg Glu Gin Lys Leu Asp Ala Phe Leu Gin Pro Leu 
385 390 395 400 


Ser Lys Pro Leu Ser Ser Gin Pro Gin Ala He Val Thr Glu Asp Lys 
405 410 415 


Thr Asp He Ser Ser Gly Arg Ala Arg Gin Gin Asp Glu Glu Met Leu 
420 425 430 


Glu Leu Pro Ala Pro Ala Glu Val Ala Ala Lys Asn Gin Ser Leu Glu 
435 440 445 


Gly Asp Thr Thr Lys Gly Thr Ser Glu Met Ser Glu Lys Arg Gly Pro 
450 455 460 


Thr Ser Ser Asn Pro Arg Lys Arg His Arg Glu Asp Ser Asp Val Glu 
465 470 475 480 


Met Val Glu Asp Asp Ser Arg Lys Glu Met Thr Ala Ala Cys Thr Pro 


Arg Arg Arg He He Asn Leu Thr Ser Val Leu Ser Leu Gin Glu Glu 
500 505 510 


He Asn Glu Gin Gly His Glu Val Leu Arg Glu Met Leu His Asn His 
515 520 525 


Ser Phe Val Gly Cys Val Asn Pro Gin Trp Ala Leu Ala Gin His Gin 
530 535 540 


Thr Lys Leu Tyr Leu Leu Asn Thr Thr Lys Leu Ser Glu Glu Leu Phe 
545 550 555 560 


Tyr Gin He Leu He Tyr Asp Phe Ala Asn Phe Gly Val Leu Arg Leu 
565 570 575 


Ser Glu Pro Ala Pro Leu Phe Asp Leu Ala Met Leu Ala Leu Asp Ser 
580 585 590 


Pro Glu Ser Gly Trp Thr Glu Glu Asp Gly Pro Lys Glu Gly Leu Ala 
595 600 605 


Glu Tyr He Val Glu Phe Leu Lys Lys Lys Ala Glu Met Leu Ala Asp 
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Tyr Phe Ser Leu Glu He Asp Glu Glu Gly Asn Leu He Gly Leu Pro 
625 630 635 640 

Leu Leu He Asp Asn Tyr Val Pro Pro Leu Glu Gly Leu Pro He Phe 
645 650 655 

He Leu Arg Leu Ala Thr Glu Val Asn Trp Asp Glu Glu Lys Glu Cys 
660 665 670 

Phe Glu Ser Leu Ser Lys Glu Cys Ala Met Phe Tyr Ser He Arg Lys 
675 680 685 

Gin Tyr He Ser Glu Glu Ser Thr Leu Ser Gly Gin Gin Ser Glu Val 
690 695 700 

Pro Gly Ser He Pro Asn Ser Trp Lys Trp Thr Val Glu His He Val 
705 710 715 720 

Tyr Lys Ala Leu Arg Ser His He Leu Pro Pro Lys His Phe Thr Glu 
725 730 735 

Asp Gly Asn He Leu Gin Leu Ala Asn Leu Pro Asp Leu Tyr Lys Val 
740 745 750 

Phe Glu Arg Cys 
755 


<210> 23 

<211> 2484 

<212> DNA 

<213> Homo sapiens 

<400> 23 

cttggctctt ctggcgccaa aatgtcgttc 

acagtggtga accgcatcgc ggcgggggaa 

gagatgattg agaactgttt agatgcaaaa 

ggaggcctga agttgattca gatccaagac 

gatattgtat gtgaaaggtt cactactagt 

atttctacct atggctttcg aggtgaggct 

actattacaa cgaaaacagc tgatggaaag 

aaactgaaag cccctcctaa accatgtgct 

gacctttttt acaacatagc cacgaggaga 

gggaaaattt tggaagttgt tggcaggtat 

gttaaaaaac aaggagagac agtagctgat 

gacaatattc gctccatctt tggaaatgct 


gtggcagggg ttattcggcg gctggacgag 60 
gttatccagc ggccagctaa tgctatcaaa 120 
tccacaagta ttcaagtgat tgttaaagag 180 
aatggcaccg ggatcaggaa agaagatctg 240 
aaactgcagt cctttgagga tttagccagt 300 
ttggccagca taagccatgt ggctcatgtt 360 
tgtgcataca gagcaagtta ctcagatgga 420 
ggcaatcaag ggacccagat cacggtggag 480 
aaagctttaa aaaatccaag tgaagaatat 540 
tcagtacaca atgcaggcat tagtttctca 600 
gttaggacac tacccaatgc ctcaaccgtg 660 
gttagtcgag aactgataga aattggatgt 720 
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gaggataaaa ccctagcctt caaaatgaat 

aagaagtgca tcttcttact cttcatcaac 

aaagccatag aaacagtgta tgcagcctat 

ctcagtttag aaatcagtcc ccagaatgtg 

gttcacttcc tgcacgagga gagcatcctg 

ctcctgggct ccaattcctc caggatgtac 

ggcccctctg gggagatggt taaatccaca 

agtagtgata aggtctatgc ccaccagatg 

gatgcatttc tgcagcctct gagcaaaccc 

gaggataaga cagatatttc tagtggcagg 

ctcccagccc ctgctgaagt ggctgccaaa 

gggacttcag aaatgtcaga gaagagagga 

cgggaagatt ctgatgtgga aatggtggaa 

tgtacccccc ggagaaggat cattaacctc 

aatgagcagg gacatgaggt tctccgggag 

gtgaatcctc agtgggcctt ggcacagcat 

aagcttagtg aagaactgtt ctaccagata 

ctcaggttat cggagccagc accgctcttt 

gagagtggct ggacagagga agatggtccc 

tttctgaaga agaaggctga gatgcttgca 

gggaacctga ttggattacc ccttctgatt 

cctatcttca ttcttcgact agccactgag 

gaaagcctca gtaaagaatg cgctatgttc 

gagtcgaccc tctcaggcca gcagagtgaa 

tggactgtgg aacacattgt ctataaagcc 

ttcacagaag atggaaatat cctgcagctt 

gagaggtgtt aaatatggtt atttatgcac 

cgatacaaag tgttgtatca aagtgtgata 

cacttaagac ttatacttgc cttctgatag 

aataaataga tgtgtcttaa cata 


ggttacatat ccaatgcaaa ctactcagtg 780 
catcgtctgg tagaatcaac ttccttgaga 840 
ttgcccaaaa acacacaccc attcctgtac 900 
gatgttaatg tgcaccccac aaagcatgaa 960 
gagcgggtgc agcagcacat cgagagcaag 1020 
ttcacccaga ctttgctacc aggacttgct 1080 
acaagtctga cctcgtcttc tacttctgga 1140 
gttcgtacag attcccggga acagaagctt 1200 
ctgtccagtc agccccaggc cattgtcaca 1260 
gctaggcagc aagatgagga gatgcttgaa 1320 
aatcagagct tggaggggga tacaacaaag 1380 
cctacttcca gcaaccccag aaagagacat 1440 
gatgattccc gaaaggaaat gactgcagct 1500 
actagtgttt tgagtctcca ggaagaaatt 1560 
atgttgcata accactcctt cgtgggctgt 1620 
caaaccaagt tataccttct caacaccacc 1680 
ctcatttatg attttgccaa ttttggtgtt 1740 
gaccttgcca tgcttgcctt agatagtcca 1800 
aaagaaggac ttgctgaata cattgttgag 1860 
gactatttct ctttggaaat tgatgaggaa 1920 
gacaactatg tgcccccttt ggagggactg 1980 
gtgaattggg acgaagaaaa ggaatgtttt 204 0 
tattccatcc ggaagcagta catatctgag 2100 
gtgcctggct ccattccaaa ctcctggaag 2160 
ttgcgctcac acattctgcc tcctaaacat 2220 
gctaacctgc ctgatctata caaagtcttt 2280 
tgtgggatgt gttcttcttt ctctgtattc 2340 
tacaaagtgt accaacataa gtgttggtag 2400 
tattccttta tacacagtgg attgattata 24 60 
2484 


<210> 24 
<211> 133 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 
15 10 15 

He lie Thr Ser Val Val Ser Val Val Lys Glu Leu He Glu Asn Ser 
20 25 30 

Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 
35 40 45 ' 

Phe Asp Lys He Glu Val Arg Asp Asn Gly Glu Gly He Lys Ala Val 
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50 55 60 

Asp Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys lie Asn Ser 


His Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Arg Gly Glu Ala 
85 90 95 

Leu Gly Ser lie Cys Cys He Ala Glu Val Leu He Thr Thr Arg Thr 
100 105 110 

Ala Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 125 

He Leu Ser Gin Lys 
130 


<210> 25 

<211> 426 

<212> DNA 

<213> Homo sapiens 

<400> 25 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 

ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 

aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 

tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 

caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 

tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 
acttga 426 


<210> 26 
<211> 1360 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Met Ser Arg Gin Ser Thr Leu Tyr Ser Phe Phe Pro Lys Ser Pro Ala 
1 5 10 15 

Leu Ser Asp Ala Asn Lys Ala Ser Ala Arg Ala Ser Arg Glu Gly Gly 
20 25 30 

Arg Ala Ala Ala Ala Pro Gly Ala Ser Pro Ser Pro Gly Gly Asp Ala 
35 40 45 
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Ala Trp Ser Glu Ala Gly Pro Gly Pro Arg Pro Leu Ala Arg Ser Ala 
50 55 60 

Ser Pro Pro Lys Ala Lys Asn Leu Asn Gly Gly Leu Arg Arg Ser Val 
65 70 75 80 

Ala Pro Ala Ala Pro Thr Ser Cys Asp Phe Ser Pro Gly Asp Leu Val 
85 90 95 

Trp Ala Lys Met Glu Gly Tyr Pro Trp Trp Pro Cys Leu Val Tyr Asn 
100 105 110 

His Pro Phe Asp Gly Thr Phe lie Arg Glu Lys Gly Lys Ser Val Arg 
115 120 125 

Val His Val Gin Phe Phe Asp Asp Ser Pro Thr Arg Gly Trp Val Ser 
130 135 140 

Lys Arg Leu Leu Lys Pro Tyr Thr Gly Ser Lys Ser Lys Glu Ala Gin 
145 150 155 160 

Lys Gly Gly His Phe Tyr Ser Ala Lys Pro Glu lie Leu Arg Ala Met 
165 170 175 

Gin Arg Ala Asp Glu Ala Leu Asn Lys Asp Lys He Lys Arg Leu Glu 
180 185 190 

Leu Ala Val Cys Asp Glu Pro Ser Glu Pro Glu Glu Glu Glu Glu Met 
195 200 205 

Glu Val Gly Thr Thr Tyr Val Thr Asp Lys Ser Glu Glu Asp Asn Glu 
210 215 220 

He Glu Ser Glu Glu Glu Val Gin Pro Lys Thr Gin Gly Ser Arg Arg 
225 230 235 240 

Ser Ser Arg Gin He Lys Lys Arg Arg Val He Ser Asp Ser Glu Ser 
245 250 255 

Asp He Gly Gly Ser Asp Val Glu Phe Lys Pro Asp Thr Lys Glu Glu 
260 265 270 

Gly Ser Ser Asp Glu He Ser Ser Gly Val Gly Asp Ser Glu Ser Glu 
275 280 285 

Gly Leu Asn Ser Pro Val Lys Val Ala Arg Lys Arg Lys Arg Met Val 
290 295 300 
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Thr Gly Asn Gly Ser leu Lys Arg Lys Ser Ser Arg Lys Glu Thr Pro 
305 310 315 320 

Ser Ala Thr Lys Gin Ala Thr Ser He Ser Ser Glu Thr Lys Asn Thr 
325 330 335 

Leu Arg Ala Phe' Ser Ala Pro Gin Asn Ser Glu Ser Gin Ala His Val 
340 345 350 

Ser Gly Gly Gly Asp Asp Ser Ser Arg Pro Thr Val Trp Tyr His Glu 
355 360 365 

Thr Leu Glu Trp Leu Lys Glu Glu Lys Arg Arg Asp Glu His Arg Arg 
370 375 380 

Arg Pro Asp His Pro Asp Phe Asp Ala Ser Thr Leu Tyr Val Pro Glu 
385 390 395 400 

Asp Phe Leu Asn Ser Cys Thr Pro Gly Met Arg Lys Trp Trp. Gin He 
405 410 415 

Lys Ser Gin Asn Phe Asp Leu Val He Cys Tyr Lys Val Gly Lys Phe 
420 425 430 

Tyr Glu Leu Tyr His Met Asp Ala Leu He Gly Val Sex Glu Leu Gly 
435 440 445 

Leu Val Phe Met Lys Gly Asn Trp Ala His Ser Gly Phe Pro Glu lie 
450 455 460 

Ala Phe Gly Arg Tyr Ser Asp Ser Leu Val Gin Lys Gly Tyr Lys Val 
465 470 475 480 

Ala Arg Val Glu Gin Thr Glu Thr Pro Glu Met Met Glu Ala Arg Cys 
485 490 495 

Arg Lys Met Ala His He Ser Lys Tyr Asp Arg Val Val Arg Arg Glu 
500 505 510 

He Cys Arg He He Thr Lys Gly Thr Gin Thr Tyr Ser Val Leu Glu 
515 520 525 

Gly Asp Pro Ser Glu Asn Tyr Ser Lys Tyr Leu Leu Ser Leu Lys Glu 
530 535 540 

Lys Glu Glu Asp Ser Ser Gly His Thr Arg Ala Tyr Gly Val Cys Phe 
545 550 555 560 
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Val Asp Thr Ser Leu Gly Lys Phe Phe lie Gly Gin Phe Ser Asp Asp 
565 570 575 


Arg His Cys Ser Arg Phe Arg Thr Leu Val Ala His Tyr Pro Pro Val 
580 585 590 


Gin Val Leu Phe Glu Lys Gly Asn Leu Ser Lys Glu Thr Lys Thr lie 
595 600 605 


Leu Lys Ser Ser Leu Ser Cys Ser Leu Gin Glu Gly Leu lie Pro Gly 
610 615 620 


Ser Gin Phe Trp Asp Ala Ser Lys Thr Leu Arg Thr Leu Leu Glu Glu 
625 630 635 640 


Glu Tyr Phe Arg Glu Lys Leu Ser Asp Gly lie Gly Val Met Leu Pro 
645 650 655 


Gin Val Leu Lys Gly Met Thr Ser Glu Ser Asp Ser He Gly Leu Thr 
660 665 670 


Pro Gly Glu Lys Ser Glu Leu Ala Leu Ser Ala Leu Gly Gly Cys Val 
675 1 680 685 


Phe Tyr Leu Lys Lys Cys Leu He Asp Gin Glu Leu Leu Ser Met Ala 
690 695 700 


Asn Phe Glu Glu Tyr He Pro Leu Asp Ser Asp Thr Val Ser Thr Thr 
705 710 715 720 


Arg Ser Gly Ala He Phe Thr Lys Ala Tyr Gin Arg Met Val Leu Asp 
725 730 735 


Ala Val Thr Leu Asn Asn Leu Glu He Phe Leu Asn Gly Thr Asn Gly 
740 745 750 


Ser Thr Glu Gly Thr Leu Leu Glu Arg Val Asp Thr Cys His Thr Pro 
755 760 765 


Phe Gly Lys Arg Leu Leu Lys Gin Trp Leu Cys Ala Pro Leu Cys Asn 
770 775 780 


His Tyr Ala He Asn Asp Arg Leu Asp Ala He Glu Asp Leu Met Val 
785 790 795 800 


Val Pro Asp Lys He Ser Glu Val Val Glu Leu Leu Lys Lys Leu Pro 
805 810 815 
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Asp Leu Glu Arg Leu Leu Ser Lys lie His Asn Val Gly Ser Pro Leu 
820 825 830 

Lys Ser Gin Asn His Pro Asp Ser Arg Ala He Met Tyr Glu Glu Thr 
835 840 845 

Thr Tyr Ser Lys Lys Lys He He Asp Phe Leu Ser Ala Leu Glu Gly 
850 855 860 

Phe Lys Val Met Cys Lys He He Gly He Met Glu Glu Val Ala Asp 
865 870 875 880 

Gly Phe Lys Ser Lys He Leu Lys Gin Val He Ser Leu Gin Thr Lys 
885 890 895 

Asn Pro Glu Gly Arg Phe Pro Asp Leu Thr Val Glu Leu Asn Arg Trp 
900 905 910 

Asp Thr Ala Phe Asp His Glu Lys Ala Arg Lys Thr Gly Leu He Thr 
915 920 925 

Pro Lys Ala Gly Phe Asp Ser Asp Tyr Asp Gin Ala Leu Ala Asp He 
930 935 940 

Arg Glu Asn Glu Gin Ser Leu Leu Glu Tyr Leu Glu Lys Gin Arg Asn 
945 950 955 960 

Arg He Gly Cys Arg Thr He Val Tyr Trp Gly He Gly Arg Asn Arg 
965 970 975 

Tyr Gin Leu Glu He Pro Glu Asn Phe Thr Thr Arg Asn Leu Pro Glu 
980 985 990 

Glu Tyr Glu Leu Lys Ser Thr Lys Lys Gly Cys Lys Arg Tyr Trp Thr 
995 1000 1005 

Lys Thr He Glu Lys Lys Leu Ala Asn Leu He Asn Ala Glu Glu Arg 
1010 1015 1020 

Arg Asp Val Ser Leu Lys Asp Cys Met Arg Arg Leu Phe Tyr Asn Phe 
1025 1030 1035 1040 

Asp Lys Asn Tyr Lys Asp Trp Gin Ser Ala Val Glu Cys He Ala Val 
1045 1050 1055 


Leu Asp Val Leu Leu Cys Leu Ala Asn Tyr Ser Arg Gly Gly Asp Gly 
1060 1065 1070 
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Pro Met Cys Arg Pro Val lie Leu Leu Pro Glu Asp Thr Pro Pro Phe 

1075 1080 1085 

Leu Glu Leu Lys Gly Ser Arg His Pro Cys lie Thr Lys Thr Phe Phe 

1090 1095 1100 

Gly Asp Asp Phe lie Pro Asn Asp lie Leu lie Gly Cys Glu Glu Glu 

1105 1110 1115 1120 

Glu Gin Glu Asn Gly Lys Ala Tyr Cys Val Leu Val Thr Gly Pro Asn 
1125 1130 1135 

Met Gly Gly Lys Ser Thr Leu Met Arg Gin Ala Gly Leu Leu Ala Val 

1140 1145 1150 

Met Ala Gin Met Gly Cys Tyr Val Pro Ala Glu Val Cys Arg Leu Thr 

1155 1160 1165 

Pro lie Asp Arg Val Phe Thr Arg Leu Gly Ala Ser Asp Arg lie Met 

1170 1175 1180 

Ser Gly Glu Ser Thr Phe Phe Val Glu Leu Ser Glu Thr Ala Ser lie 

1185 1190 1195 1200 

Leu Met His Ala Thr Ala His Ser Leu Val Leu Val Asp Glu Leu Gly 
1205 1210 1215 

Arg Gly Thr Ala Thr Phe Asp Gly Thr Ala He Ala Asn Ala Val Val 

1220 1225 1230 

Lys Glu Leu Ala Glu Thr He Lys Cys Arg Thr Leu Phe Ser Thr His 

1235 1240 1245 

Tyr His Ser Leu Val Glu Asp Tyr Ser Gin Asn Val Ala Val Arg Leu 

1250 1255 1260 

Gly His Met Ala Cys Met Val Glu Asn Glu Cys Glu Asp Pro Ser Gin 

1265 1270 1275 1280 

Glu Thr He Thr Phe Leu Tyr Lys Phe He Lys Gly Ala Cys Pro Lys 
1285 1290 1295 

Ser Tyr Gly Phe Asn Ala Ala Arg Leu Ala Asn Leu Pro Glu Glu Val 

1300 1305 1310 


He Gin Lys Gly His Arg Lys Ala Arg Glu Phe Glu Lys Met Asn Gin 
1315 1320 1325 
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Ser Leu Arg Leu Phe Arg Glu Val Cys Leu Ala Ser Glu Arg Ser Thr 
1330 1335 1340 

Val Asp Ala Glu Ala Val His Lys Leu Leu Thr Leu lie Lys Glu Leu 

1345 1350 1355 1360 


<210> 27 

<211> 4244 

<212> DKA 

<213> Homo sapiens 

<400> 27 

gccgcgcggt agatgcggtg cttttaggag 
ggctgtcggt atgtcgcgac agagcaccct 
gagtgatgcc aacaaggcct cggccagggc 
ccccggggcc tctccttccc caggcgggga 
caggcccttg gcgcgctccg cgtcaccgcc 
gagatcggta gcgcctgctg cccccaccag 
ggccaagatg gagggttacc cctggtggcc 
aacattcatc cgcgagaaag ggaaatcagt 
cccaacaagg ggctgggtta gcaaaaggct 
ggaagcccag aagggaggtc atttttacag 
acgtgcagat gaagccttaa ataaagacaa 
tgagccctca gagccagaag aggaagaaga 
taagagtgaa gaagataatg aaattgagag 
atctaggcga agtagccgcc aaataaaaaa 
cattggtggc tctgatgtgg aatttaagcc 
aataagcagt ggagtggggg atagtgagag 
tcgaaagcgg aagagaatgg tgactggaaa 
ggaaacgccc tcagccacca aacaagcaac 
gagagctttc tctgcccctc aaaattctga 
tgacagtagt cgccctactg tttggtatca 
gagaagagat gagcacagga ggaggcctga 
tgtgcctgag gatttcctca attcttgtac 
gtctcagaac tttgatcttg tcatctgtta 
catggatgct cttattggag tcagtgaact 
ccattctggc tttcctgaaa ttgcatttgg 
ctataaagta gcacgagtgg aacagactga 
aaagatggca catatatcca agtatgatag 
taccaagggt acacagactt acagtgtgct 
gtatcttctt agcctcaaag aaaaagagga 
tgtgtgcttt gttgatactt cactgggaaa 
ccattgttcg agatttagga ctctagtggc 


ctccgtccga cagaacggtt gggccttgcc 60 
gtacagcttc ttccccaagt ctccggcgct 120 
ctcacgcgaa ggcggccgtg ccgccgctgc 180 
tgcggcctgg agcgaggctg ggcctgggcc 240 
caaggcgaag aacctcaacg gagggctgcg 300 
ttgtgacttc tcaccaggag atttggtttg 360 
ttgtctggtt tacaaccacc cctttgatgg 420 
ccgtgttcat gtacagtttt ttgatgacag 480 
tttaaagcca tatacaggtt caaaatcaaa 540 
tgcaaagcct gaaatactga gagcaatgca 600 
gattaagagg cttgaattgg cagtttgtga 660 
gatggaggta ggcacaactt acgtaacaga 720 
tgaagaggaa gtacagccta agacacaagg 780 
acgaagggtc atatcagatt ctgagagtga 8 40 
agacactaag gaggaaggaa gcagtgatga 900 
tgaaggcctg aacagccctg tcaaagttgc 960 
tggctctctt aaaaggaaaa gctctaggaa 1020 
tagcatttca tcagaaacca agaatacttt 1080 
atcccaagcc cacgttagtg gaggtggtga 1140 
tgaaacttta gaatggctta aggaggaaaa 1200 
tcaccccgat tttgatgcat ctacactcta 1260 
tcctgggatg aggaagtggt ggcagattaa 1320 
caaggtgggg aaattttatg agctgtacca 1380 
ggggctggta ttcatgaaag gcaactgggc 1440 
ccgttattca gattccctgg tgcagaaggg 1500 
gactccagaa atgatggagg cacgatgtag 1560 
agtggtgagg agggagatct gtaggatcat 1620 
ggaaggtgat ccctctgaga actacagtaa 1680 
agattcttct ggccatactc gtgcatatgg 17 40 
gtttttcata ggtcagtttt cagatgatcg 1800 
acactatccc ccagtacaag ttttatttga 1860 
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aaaaggaaat ctctcaaagg aaactaaaac aattctaaag agttcattgt cctgttctct 1920 
tcaggaaggt ctgatacccg gctcccagtt ttgggatgca tccaaaactt tgagaactct 1980 
ccttgaggaa gaatatttta gggaaaagct aagtgatggc attggggtga tgttacccca 2040 
ggtgcttaaa ggtatgactt cagagtctga ttccattggg ttgacaccag. gagagaaaag 2100 
tgaattggcc cfcctctgctc taggtggttg tgtcttctac ctcaaaaaat gccttattga 2160 
tcaggagctt ttatcaatgg ctaattttga agaatatatt cccttggatt ctgacacagt 2220 
cagcactaca agatctggtg ctatcttcac caaagcctat caacgaatgg tgctagatgc 2280 
agtgacatta aacaacttgg agatttttct gaatggaaca aatggttcta ctgaaggaac 2340 
cctactagag agggttgata cttgccatac tccttttggt aagcggctcc taaagcaatg 2400 
gctttgtgcc ccactctgta accattatgc tattaatgat cgtctagatg ccatagaaga 24 60 
cctcatggtt gtgcctgaca aaatctccga agttgtagag cttctaaaga agcttccaga 2520 
tcttgagagg ctactcagta aaattcataa tgttgggtct cccctgaaga gtcagaacca 2580 
cccagacagc agggctataa tgtatgaaga aactacatac agcaagaaga agattattga 2640 
ttttctttct gctctggaag gattcaaagt aatgtgtaaa attataggga tcatggaaga 2700 
agttgctgat ggttttaagt ctaaaatcct taagcaggtc atctctctgc agacaaaaaa 2760 
tcctgaaggt cgttttcctg atttgactgt agaattgaac cgatgggata cagcctttga 2820 
ccatgaaaag gctcgaaaga ctggacttat tactcccaaa gcaggctttg actctgatta 2880 
tgaccaagct cttgctgaca taagagaaaa tgaacagagc ctcctggaat acctagagaa 2940 
acagcgcaac agaattggct gtaggaccat agtctattgg gggattggta ggaaccgtta 3000 
ccagctggaa attcctgaga atttcaccac tcgcaatttg ccagaagaat acgagttgaa 3060 
atctaccaag aagggctgta aacgatactg gaccaaaact attgaaaaga agttggctaa 3120 
tctcataaat gctgaagaac ggagggatgt atcattgaag gactgcatgc ggcgactgtt 3180 
ctataacttt gataaaaatt acaaggactg gcagtctgct gtagagtgta tcgcagtgtt 3240 
ggatgtttta ctgtgcctgg ctaactatag tcgagggggt gatggtccta tgtgtcgccc 3300 
agtaattctg ttgccggaag ataccccccc cttcttagag cttaaaggat cacgccatcc 3360 
ttgcattacg aagacttttt ttggagatga ttttattcct aatgacattc taataggctg 3420 
tgaggaagag gagcaggaaa atggcaaagc ctattgtgtg cttgttactg gaccaaatat 3480 
ggggggcaag tctacgctta tgagacaggc tggcttatta gctgtaatgg cccagatggg 3540 
ttgttacgtc cctgctgaag tgtgcaggct cacaccaatt gatagagtgt ttactagact 3600 
tggtgcctca gacagaataa tgtcaggtga aagtacattt tttgttgaat taagtgaaac 3660 
tgccagcata ctcatgcatg caacagcaca ttctctggtg cttgtggatg aattaggaag 3720 
aggtactgca acatttgatg ggacggcaat agcaaatgca gttgttaaag aacttgotga 3780 
gactataaaa tgtcgtacat tattttcaac tcactaccat tcattagtag aagattattc 3840 
tcaaaatgtt gctgtgcgcc taggacatat ggcatgcatg gtagaaaatg aatgtgaaga 3900 
ccccagccag gagactatta cgttcctcta taaattcatt aagggagctt gtcctaaaag 3960 
ctatggcttt aatgcagcaa ggcttgctaa tctcccagag gaagttattc aaaagggaca 4020 
tagaaaagca agagaatttg agaagatgaa tcagtcacta cgattatttc gggaagtttg 4080 
cctggctagt gaaaggtcaa ctgtagatgc tgaagctgtc cataaattgc tgactttgat 4140 
taaggaatta tagactgact acattggaag ctttgagttg acttctgaca aaggtggtaa 4200 
attcagacaa cattatgatc taataaactt tattttttaa aaat 4244 


<210> 28 

<211> 1128 

<212> PRT 

<213> Homo sapiens 

<400> 28 


36 


WO 02/054856 


PCTYUS01/00934 


Met Ser Arg Arg Lys Pro Ala Ser Gly Gly Leu Ala Ala Ser Ser Ser 
15 10 15 

Ala Pro Ala Arg Gin Ala Val Leu Ser Arg Phe Phe Gin Ser Thr Gly 
20 25 30 

Ser Leu Lys Ser Thr Ser Ser Ser Thr Gly Ala Ala Asp Gin Val Asp 
35 40 45 

Pro Gly Ala Ala Ala Ala Ala Ala Pro Pro Ala Pro Ala Phe Pro Pro 
50 55 60 

Gin Leu Pro Pro His Val Ala Thr Glu lie Asp Arg Arg Lys Lys Arg 
65 70 75 80 

Pro Leu Glu Asn Asp Gly Pro Val Lys Lys Lys Val Lys Lys Val Gin 
85 90 95 

Gin Lys Glu Gly Gly Ser Asp Leu Gly Met Ser Gly Asn Ser Glu Pro 
100 105 110 

Lys Lys Cys Leu Arg Thr Arg Asn Val Ser Lys Ser Leu Glu Lys Leu 
115 120 125 

Lys Glu Phe Cys Cys Asp Ser Ala Leu Pro Gin Ser Arg Val Gin Thr 
130 135 140 

Glu Ser Leu Gin Glu Arg Phe Ala Val Leu Pro Lys Cys Thr Asp Phe 
145 150 155 160 

Asp Asp lie Ser Leu Leu His Ala Lys Asn Ala Val Ser Ser Glu Asp 
165 170 175 

Ser Lys Arg Gin He Asn Gin Lys Asp Thr Thr Leu Phe Asp Leu Ser 
180 185 190 

Gin Phe Gly Ser Ser Asn Thr Ser His Glu Asn Leu Gin Lys Thr Ala 
195 200 205 

Ser Lys Ser Ala Asn Lys Arg Ser Lys Ser He Tyr Thr Pro Leu Glu 
210 215 220 

Leu Gin Tyr He Glu Met Lys Gin Gin His Lys Asp Ala Val Leu Cys 
225 230 235 240 

Val Glu Cys Gly Tyr Lys Tyr Arg Phe Phe Gly Glu Asp Ala Glu He 
245 250 255 
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Ala Ala Arg Glu Leu Asn lie Tyr Cys His Leu Asp His Asn Phe Met 
260 265 270 


Thr Ala Ser lie Pro Thr His Arg Leu Phe Val His Val Arg Arg Leu 
275 280 285 


Val Ala Lys Gly Tyr Lys Val Gly Val Val Lys Gin Thr Glu Thr Ala 
290 295 300 


Ala Leu Lys Ala lie Gly Asp Asn Arg Ser Ser Leu Phe Ser Arg Lys 
305 310 315 320 


Leu Thr Ala Leu Tyr Thr Lys Ser Thr Leu He Gly Glu Asp Val Asn 
325 330 335 


Pro Leu lie Lys Leu Asp Asp Ala Val Asn Val Asp Glu He Met Thr 
340 345 350 


Asp Thr Ser Thr Ser Tyr Leu Leu Cys He Ser Glu Asn Lys Glu Asn 
355 360 365 


Val Arg Asp Lys Lys Lys Gly Asn He Phe He Gly He Val Gly Val 
370 375 380 


Gin Pro Ala Thr Gly Glu Val Val Phe Asp Ser Phe Gin Asp Ser Ala 
385 390 395 400 


Ser Arg Ser Glu Leu Glu Thr Arg Met Ser Ser Leu Gin Pro Val Glu 
405 410 415 


Leu Leu Leu Pro Ser Ala Leu Ser Glu Gin Thr Glu Ala Leu lie His 
420 425 430 


Arg Ala Thr Ser Val Ser Val Gin Asp Asp Arg He Arg Val Glu Arg 
435 440 445 


Met Asp Asn He Tyr Phe Glu Tyr Ser His Ala Phe Gin Ala Val Thr 
450 455 460 


Glu Phe Tyr Ala Lys Asp Thr Val Asp He Lys Gly Ser Gin He He 
465 470 475 480 


Ser Gly He Val Asn Leu Glu Lys Pro Val He Cys Ser Leu Ala Ala 
485 490 495 


He He Lys Tyr Leu Lys Glu Phe Asn Leu Glu Lys Met Leu Ser Lys 
500 505 510 


38 


WO 02/054856 


PCT/US01/00934 


Pro Glu Asn Phe Lys Gin Leu Ser Ser Lys Met Glu Phe Met Thr lie 
515 520 525 

Asn Gly Thr Thr Leu Arg Asn Leu Glu He Leu Gin Asn Gin Thr Asp 
530 535 540 

Met Lys Thr Lys Gly Ser Leu Leu Trp Val Leu Asp His Thr Lys Thr 
545 550 555 560 

Ser Phe Gly Arg Arg Lys Leu Lys Lys Trp Val Thr Gin Pro Leu Leu 
565 570 575 

Lys Leu Arg Glu He Asn Ala Arg Leu Asp Ala Val Ser Glu Val Leu 
580 585 590 

His Ser Glu Ser Ser Val Phe Gly Gin He Glu Asn His Leu Arg Lys 
595 600 605 

Leu Pro Asp He Glu Arg Gly Leu Cys Ser He Tyr His Lys Lys Cys 
610 615 620 

Ser Thr Gin Glu Phe Phe Leu He Val Lys Thr Leu Tyr His Leu Lys 
625 630 635 640 

Ser Glu Phe Gin Ala He He Pro Ala Val Asn Ser His He Gin Ser 
645 650 655 

Asp Leu Leu Arg Thr Val He Leu Glu He Pro Glu Leu Leu Ser Pro 
660 665 670 

Val Glu His Tyr Leu Lys He Leu Asn Glu Gin Ala Ala Lys Val Gly 
675 680 685 

Asp Lys Thr Glu Leu Phe Lys Asp Leu Ser Asp Phe Pro Leu He Lys 
690 695 700 

Lys Arg Lys Asp Glu He Gin Gly Val He Asp Glu He Arg Met His 
705 710 715 720 

Leu Gin Glu He Arg Lys He Leu Lys Asn Pro Ser Ala Gin Tyr Val 
725 730 735 

Thr Val Ser Gly Gin Glu Phe Met He Glu He Lys Asn Ser Ala Val 
740 745 750 

Ser Cys He Pro Thr Asp Trp Val Lys Val Gly Ser Thr Lys Ala Val 
755 760 765 
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Ser Arg Phe His Ser Pro Phe lie Val Glu Asn Tyr Arg His Leu Asn 
770 775 780 

Gin Leu Arg Glu Gin Leu Val Leu Asp Cys Ser Ala Glu Trp Leu Asp 
785 790 795 ~ 800 

Phe Leu Glu Lys Phe Ser Glu His Tyr His Ser Leu Cys Lys Ala Val 
805 810 815 

His His Leu Ala Thr Val Asp Cys lie Phe Ser Leu Ala Lys Val Ala 
820 825 830 

Lys Gin Gly Asp Tyr Cys Arg Pro Thr Val Gin Glu Glu Arg Lys lie 
835 840 845 

Val lie Lys Asn Gly Arg His Pro Val He Asp Val Leu Leu Gly Glu 
850 855 860 

Gin Asp Gin Tyr Val Pro Asn Asn Thr Asp Leu Ser Glu Asp Ser Glu 
865 870 875 880 

Arg Val Met He He Thr Gly Pro Asn Met Gly Gly Lys Ser Ser Tyr 
885 890 895 

He Lys Gin Val Ala Leu He Thr He Met Ala Gin He Gly Ser Tyr 
900 905 910 

Val Pro Ala Glu Glu Ala Thr He Gly He Val Asp Gly He Phe Thr 
915 920 925 

Arg Met Gly Ala Ala Asp Asn He Tyr Lys Gly Arg Ser Thr Phe Met 
930 935 940 

Glu Glu Leu Thr Asp Thr Ala Glu He He Arg Lys Ala Thr Ser Gin 
945 950 955 960 

Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr His Asp 
965 970 975 

Gly He Ala He Ala Tyr Ala Thr Leu Glu Tyr Phe He Arg Asp Val 
980 985 990 

Lys Ser Leu Thr Leu Phe Val Thr His Tyr Pro Pro Val Cys Glu Leu 
995 1000 1005 

Glu Lys Asn Tyr Ser His Gin Val Gly Asn Tyr His Met Gly Phe Leu 
1010 1015 1020 
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Val Ser Glu Asp Glu Ser Lys Leu Asp Pro Gly Ala Ala Glu Gin Val 
1025 1030 1035 1040 

Pro Asp Phe Val Thr Phe Leu Tyr Gin He Thr Arg Gly He Ala Ala 
1045 1050 1055 

Arg Ser Tyr Gly Leu Asn Val Ala Lys Leu Ala Asp Val Pro Gly Glu 
1060 1065 1070 

He Leu Lys Lys Ala Ala His Lys Ser Lys Glu Leu Glu Gly Leu He 
1075 1080 1085 

Asn Thr Lys Arg Lys Arg Leu Lys Tyr Phe Ala Lys Leu Trp Thr Met 
1090 1095 1100 

His Asn Ala Gin Asp Leu Gin Lys Trp Thr Glu Glu Phe Asn Met Glu 
1105 1110 1115 1120 

Glu Thr Gin Thr Ser Leu Leu His 
1125 


<210> 29 

<211> 4374 

<212> DNA 

<213> Homo sapiens 

<400> 29 

gggcacgagc cctgccatgt ctcgccggaa 
ctcagcccct gcgaggcaag cggttttgag 
atccacctcc tcctccacag gtgcagccga 
agcgccccca gcgcccgcct tcccgcccca 
cagaagaaag aagagaccat tggaaaatga 
ccaacaaaag gaaggaggaa gtgatctggg 
tctgaggacc aggaatgttt caaagtctct 
tgcccttcct caaagtagag tccagacaga 
aaaatgtact gattttgatg atatcagtct 
agattcgaaa cgtcaaatta atcaaaagga 
atcatcaaat acaagtcatg aaaatttaca 
gtccaaaagc atctatacgc cgctagaatt 
agatgcagtt ttgtgtgtgg aatgtggata 
gattgcagcc cgagagctca atatttattg 
tatacctact cacagactgt ttgttcatgt 
gggagttgtg aagcaaactg aaactgcagc 
actcttttcc cggaaattga ctgcccttta 
gaatccccta atcaagctgg atgatgctgt 
taccagctat cttctgtgca tctctgaaaa 
caacattttt attggcattg tgggagtgca 


gcctgcgtcg ggcggcctcg ctgcctccag 60 
ccgattcttc cagtctacgg gaagcctgaa 120 
ccaggtggac cctggcgctg cagcggccgc 180 
gctgccgccg cacgtagcta cagaaattga 240 
tgggcctgtt aaaaagaaag taaagaaagt 300 
aatgtctggc aactctgagc caaagaaatg 360 
ggaaaaattg aaagaattct gctgcgattc 420 
atctctgcag gagagatttg cagttctgcc 480 
tctacacgca aagaatgcag tttcttctga 540 
cacaacactt tttgatctca gtcagtttgg 600 
gaaaactgct tccaaatcag ctaacaaacg 660 
acaatacata gaaatgaagc agcagcacaa 720 
taagtataga ttctttgggg aagatgcaga 780 
ccatttagat cacaacttta tgacagcaag 840 
acgccgcctg gtggcaaaag gatataaggt 900 
attaaaggcc attggagaca acagaagttc 960 
tacaaaatct acacttattg gagaagatgt 1020 
aaatgttgat gagataatga ctgatacttc 1080 
taaggaaaat gttagggaca aaaaaaaggg 1140 
gcctgccaca ggcgaggttg tgtttgatag 1200 
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tttccaggac tctgcttctc gttcagagct 
agagctgctg cttccttcgg ccttgtccga 
atctgttagt gtgcaggatg acagaattcg 
atacagccat gctttccagg cagttacaga 
aggttctcaa attatttctg gcattgttaa 
tgccatcata aaatacctca aagaattcaa 
ttttaaacag ctatcaagta aaatggaatt 
•tctggaaatc ctacagaatc agactgatat 
agaccacact aaaacttcat ttgggagacg 
ccttaaatta agggaaataa atgcccggct 
atctagtgtg tttggtcaga tagaaaatca 
actctgtagc atttatcaca aaaaatgttc 
tttatatcac ctaaagtcag aatttcaagc 
gtcagacttg ctccggaccg ttattttaga 
ttacttaaag atactcaatg aacaagctgc 
agacctttct gacttccctt taataaaaaa 
cgagatccga atgcatttgc aagaaatacg 
tgtgacagta tcaggacagg agtttatgat 
accaactgat tgggtaaagg ttggaagcac 
tattgtagaa aattacagac atctgaatca 
tgctgaatgg cttgattttc tagagaaatt 
agtgcatcac ctagcaactg ttgactgcat 
agattactgc agaccaactg tacaagaaga 
ccctgtgatt gatgtgttgc tgggagaaca 
atcagaggac tcagagagag taatgataat 
ctacataaaa caagttgcat tgattaccat 
agaagaagcg acaattggga ttgtggatgg 
tatatataaa ggacggagta catttatgga 
aaaagcaaca tcacagtcct tggttatctt 
tgatggaatt gccattgcct atgctacact 
aaccctgttt gtcacccatt atccgccagt 
ggtggggaat taccacatgg gattcttggt 
cgcagcagaa caagtccctg attttgtcac 
agcaaggagt tatggattaa atgtggctaa 
gaaagcagct cacaagtcaa aagagctgga 
caagtatttt gcaaagttat ggacgatgca 
ggagttcaac atggaagaaa cacagacttc 
tgaacaaaaa atggagaatt aaaaatacca 
tatctttgtg tgacatgtga gcataaaatt 
agaggttttt ctgaagacag tctttttcaa 
aacactcttg aatagacttc cactttgtaa 
aaagccttaa gtggcagaat ataattccca 
tgatattttt atttgtttca gttcagataa 
atccattgaa ctaaaataat tttattatgc 
tttttataag tagaaagaat tggccaggca 
tgggaggcca aggtaggcag atcacctgag 
tggcaaaacc ccatctttac taaaaatata 
ttagctgggc atggtggcgc acacctgtag 


agaaacccgg atgtcaagcc tgcagccagt 1260 
gcaaacagag gcgctcatcc acagagccac 1320 
agtcgaaagg atggataaca tttattttga 1380 
gttttatgca aaagatacag ttgacatcaa 1440 
cttagagaag cctgtgattt gctctttggc 1500 
cttggaaaag atgotctcca aacctgagaa 1560 
tatgacaatt aatggaacaa cattaaggaa 1620 
gaaaaccaaa ggaagtttgc tgtgggtttt 1680 
gaagttaaag aagtgggtga cccagccact 1740 
tgatgctgta tcggaagttc tccattcaga 1800 
tctacgtaaa ttgcccgaca tagagagggg 1860 
tacccaagag ttcttcttga ttgtcaaaac 1920 
aataatacct gctgttaatt cccacattca 1980 
aattcctgaa ctcctcagtc cagtggagca 2040 
caaagttggg gataaaactg aattatttaa 2100 
gaggaaggat gaaattcaag gtgttattga 2160 
aaaaatacta aaaaatcctt ctgcacaata 2220 
agaaataaag aactetgctg tatcttgtat 2280 
aaaagctgtg agccgctttc actctccttt 2340 
gctccgggag cagctagtcc ttgactgcag 2400 
cagtgaacat tatcactcct tgtgtaaagc 24 60 
tttctccctg gccaaggtcg ctaagcaagg 2520 
aagaaaaatt gtaataaaaa atggaaggca 2580 
ggatcaatat gtcccaaata atacagattt 2640 
taccggacca aacatgggtg gaaagagctc 2700 
catggctcag attggctcct atgttcctgc 27 60 
cattttcaca aggatgggtg ctgcagacaa 2820 
agaactgact gacacagcag aaataatcag 2880 
ggatgaacta ggaagaggga cgagcactca 2940 
tgagtatttc atcagagatg tgaaatcctt 3000 
ttgtgaacta gaaaaaaatt actcacacca 3060 
cagtgaggat gaaagcaaac tggatccagg 3120 
cttcctttac caaataacta gaggaattgc 3180 
actagcagat gttcctggag aaattttgaa 3240 
aggattaata aatacgaaaa gaaagagact 3300 
taatgcacaa gacctgcaga agtggacaga 3360 
tcttcttcat taaaatgaag actacatttg 3420 
actgtacaaa ataactctcc agtaacagcc 3480 
atgaccatgg tatattccta ttggaaacag 3540 
gtttctgtct tcctaacttt tctacgtata 3600 
ttagaaaatt ttatggacag taagtccagt 3660 
agcttttgga gggtgatata aaaatttact 3720 
ttggcaactg ggtgaatctg gcaggaatct 3780 
aaccagttta tccaccaaga acataagaat 3840 
tggtggctca tgcctgtaat cccagcactt 3900 
gtcaggagtt caagaccagc ctggccaaca 3960 
aagtacatct ctactaaaaa tacgaaaaaa 4020 
tcccagctac tccggaggct gaggcaggag 4 080 
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aatctcttga acctgggagg cggaggttgc aatgagccga gatcacgtca ctgcactcca 4140 

gcttgggcaa cagagcaaga ctccatctca aaaaagaaaa aagaaaagaa atagaattat 4200 

caagctttta aaaactagag cacagaagga ataaggtcat gaaatttaaa aggttaaata 4260 

ttgtcatagg attaagcagt ttaaagattg ttggatgaaa ttatttgtca ttcattcaag 4320 

taataaatat ttaatgaata cttgctataa aaaaaaaaaa aaaaaaaaaa aaaa 4374 


<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 30 

gatatctcca ctgacgtaag 20 

<210> 31 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 31 

tgttgccggt cttgcgatg 19 

<210> 32 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
primer 

<400> 32 

cccgatctag taacatagat g 21 

<210> 33 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 33 

cagtctggat cgcgaaaact g 


<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 34 

ggtgattacc gacgaaaacg 


<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence '.oligonucleotide 
primer 


<400> 35 

agtgaagggc gaacagttcc 


<210> 36 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 


<400> 36 

gagtattgcc aacgaacc 
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<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 37 

gtatcaccgc gtctttgatc 


<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 38 

cgaaacgcag cacgatacg 


<210> 39 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 39 

gttcaacgct gacatcacc 


<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
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<400> 40 

catgttcatc tgcccagtcg 


<210> 41 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 41 

gctttggaca taccatcc 


<210> 42 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 42 

caccgaagtt catgccag 


<210> 43 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
primer 

<400> 43 

tgactacttt tgacttcagc c 


<210> 44 
<211> 22 
<212> DNA 


WO 02/054856 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 44 

aaccattcaa catttttaac cc 22 
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